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BACKGROUND 



INTRODUCTION TO THE STUDIES 

In the Spring of 1966, the first prospective national evaluation 
of the effects of participation in full-year center- based Head Start 
programs was planned "by the Research and Evaluation Office of Project 
Head Start, The evaluation was designed to identify what kind of 
program would be most effective for different children. In 196? and 

1968 the evaluation was continued and extended, with increasingly 
broad measurement of program characteristics and child development, 
and with designs that increasingly specified what kinds of program 
variations would be studied. The data were collected during these 
three years (1966, 1967, 1968) by a network of fourteen university- 
based evaluation and research centers, prepared for data analysis in 

1969 by World Systems, Inc. (Contract OEO-B-99-502^) and independently 
analyzed during 1971-1972 by System Development Corporation (SDC, 
1972a; SDC, 1972b) and Research Triangle Institute (RTI, 1972). The 
approach of having two teams independently address the same issue was 
adopted to reduce the controversy so often directed at the analytic 
methodology of major evaluation studies. The purpose of this report 

is to summarize the findings from these two technical reports on the 
immediate changes in child development associated with Head Start, 
and to present a statement of the extent to which these changes are 
enhanced or attenuated by different program experiences for different 
children. 

The studies (i.e., the separate evaluations for 1966, 1967 and 
1968) were designed to identify immediate changes associated with 
Head Start participation as well as the conditions under which these 
changes were greatest. There were no control groups of eligible 
children who did not attend Head Start since addressing the issue of 
what kinds of clsissroom experiences "work best" for different 
children, required comparisons within the Head Start sample and not 
between Head Start and non-Head Start preschool populations. 

While the studies addressed only a limited airray of the many 
questions asked about early education and Head Start, the data 
nevertheless represent a rich and varied source of information on the 
child in relation to his classroom experience. The questions asked 
were: 



.What are Head Start children and families like and how do 
they differ from one another? 

.How do background characteristics of Head Start families 
and children influence children's initial performance? 
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.What are Head Start programs like? 

• What chan^Jces in children's social adjustment, motivation, 
scholastic readiness and cognitive performance are 
associated with participation in Head Start? 

•What changes in parent attitudes, particularly toward 
educatijn and society, are associated with children's 
participation in Head Start? 

.How do background characteristics of Head Start families and 
children influence the gains children make in the program? 

.How do teacher and program characteristics influence the 
gains children maJce in the program? 

.What are the inter-relationships between different teacher 
and program approaches and benefits or gains for different 
kinds of children in Head Start? 



OVERVIEW OF THE FINDINGS 

Head Star t children and families differed from one another on 
almost ev ery dimension studied . This finding is consistent with 
reports by Bates (1972;, Educational Testing Service (l97l), and 
others of the heterogeneity of family circumstances and child develop- 
ment within "poor" families. 

2* The family was the ma.jor determinant of initial performance 
differences among He ad Start children . Specifically, parents of 
children who scored highest at the initial testing had higher 
aspirations and expectations for the child's education, were more 
accessible and read more frequently to him/her, seldom used physical 
punishment in disciplining the child and generally felt more confident 
about life. 

3- Most Head Start pr ograms were oriented in a nonspecific wav 
to general personal-social development, but varied extensively from 
one another within tha t approach . This finding is consistent with the 
view of Head Start as a broadly-based and multi -aimed child develop- 
ment program as dilstinct from an intensive compensatory education 
program aimed at remediating cognitive deficits. It supports the 
position that Head Start is a multiplicity of piograms rather than a 
single project or educational treatment. 

^' Children gained from their Head Start experience; they gained 
more in specific ski lls and cognitive functioning (achievement^ 
motivati on, adjustment to the testing situation, academic achieve ment.! 

and less in basic co stive abilities and other aspects of sonl^l 

development. The interpretation of these findings remains somewhat 
problematic. In certain areas of children's social development, e.g., 
peer sociability, it is not surprising that negligible gains were 
reported since Head Start children were already performing well in 
this area, leaving little room for improvement. However, in those 
areas where ample room for improvement did exist, i.e., basic 
cognitive abilities and cognitive functioning, the gains varied from 
slight to substantial. Two interpretations are plausible: first 
that basic cognitive abilities are less easily modified by preschool 
interventions; or second, that the larger gains in motivation and 
achievement reflect the Head Start "whole child" program approach. 

^ ^ 5. Parents' attitudes to ward education became more favorabl e 
but parents' attitudes toward soc iety did not change significantly . 
Parents attitudes toward education were favorable initially and 
became even more favorable. However, no comparable improvements in 
parents feeling of social alienation were brought about by chlldren'R 
participation in Head Start. 
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6. On the whole t, child and family background characteristics 
were imrelated to magnitude of gains made during the program . Child's 
a^e and pretest IQ were the only characteristics consistently related 
to gains. Family characteristics, an important determinant of initial 
performance levels, did not appeax to be an important determinant of 
changes associated with the program. 

7. Teacher and program characteristics were the ma.jor determinants 
of magnitude of gsdns made in the program . All aspects of teacher and 
program inputs measured in these studies were related to gains in the 
areas of cognitive functioning, cognitive abilities and social adjust- 
ment. Generally speaking, cognitive gains were associated more with 
cognitive-oriented program emphases and classroom activities; social 
gains were associated more with sociaJ. -oriented program emphases and 
with qualitative aspects of classroom atmosphere. However, the data 

do not indicate a cleax division, within the "whole child" orienta- 
tion of most Head Start prog:rams, between specific combinations of 
program inputs and child development outcomes. 

8. These data suggest that there was no one "best approach" for 
all children within the range of program variations studied . Two 
broad program clusters appeared effective for two different kinds of 
Head Staxt children: one cluster was related to the orderliness and 
structure of the environment and was paxticulaxly beneficial to 
children who initially lagged behind their peers; a second cluster 
was related to the social climate of the class and was more beneficial 
"*-o children performing at higher levels upon entry into Head Start. 
T\\e finding that what a child brings to a program influences how he 

is aiffected bv it is consistent with esirlier evidence from studies by 
Weikart (1969; and Miller (l97l), and with the viewpoint articulated by 
the developmental theorist, J. McVicker Hunt (l96l) . These data are 
among the first large scale evaluations that report inter-relationshlps 
among program and chll.d characteristics and, as such, offer tentative 
support for the policy of individualizing preschool experiences in 
Project Head Ststrt. 
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HISTORY OF PROJECT HEAD START 



Head Staxt is a comprehensive child development program designed 
to give children of poverty the services and experiences they need to 
help them develop as healthy, happy and competent individuals. 
Initiated in the summer of 1965 as part of the War on Poverty, Head 
Start was seen as helping other adult-oriented (e.g., Job Corps), 
service-oriented (e,g.. Neighborhood Health Centers), or change- 
oriented (e.g.. Legal Services), programs break the poverty cycle 
by early intervention. Since summer 1965, Head start has served almost 
three million children in the eight-week summer programs and over one 
and one-half million children in the full-year programs. 

Chief among the distinguishing characteristics of Head Start as 
conceived by the committee chaired by Dr. Robert Cook were; 

.provision of comprehensive services with pairticular 
attention to health and nutrition 

.emphasis on the importance of strengthening family life, 
and the ability of the parents to be the primary advocates, 
change agents, and educators for their children 

.focus on the child's motivational and social development 
and on the achievement of competence in everyday life 
including academic preparation for school 

The report of the Cookj Committee (1965) reflects not only a 
belief in the value of early intervention but an awareness of the 
need for continuous comprehensive support if children of poverty are 
to reach their full potential. The Cooke Memorandum describes the 
seven major objectives of Project Head Start: 

1. Improving the child's physical health and abilities. 

2. Fostering the emotional and social development of the 
child by encouraging self-confidence, spontaneity, 
cioriosity, and self-discipline. 

3. Improving the child's mental processes and skills with 
particular attention to conceptual and verbal skills. 

4. Establishing patterns of expectations of success for the 
child that will create a climate of confidence for his 
future learning efforts. 

5. Increasing the child's capacity to relate positively to 
family members and others while strengthening the family's 
ability to relate positively to the child and his problems. 
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6. Developing in the child and his family a responsible 
attitude toward societyp and fostering constructive 
opportunities for society to work together with the poor 
in solving their problems. 

7. Increasing the sense of dignity and self -worth within the 
child and his family. 

Head Start is often thought of in the context of compensatory 
education; that is, as a remedy for specific cognitive deficits in 
the participating children. Another common viewpoint sees Head Start 
as a program of oaxly childhood education, designed to take advantage 
of the young child's rapid learning and development. A review of the 
Cooke Memorandum objectives, "^however, indicates that Head Start was 
planned as a broadly based program of esirly child development to deal 
comprehensively with the needs of young children in a context of 
community involvement and parent participation. 

Within the framework of national guidelines specifying goals, 
program components, minimum staffing patterns, and policies governing 
eligibility and program administration (Head Start, 196?), individual 
Head Start programs have varied considerably in their goal emphases 
and have used a broad range of approaches including structured, 
skill-oriented training and cultural programs focused on the child's 
ethnic heritage. Not all Head Starts have been equally successful 
in the task of balancing the developmental expectations of the local 
cultural milieu and the developmental expectations of the larger 
community as expressed in requirements for school achievement. While 
approaches to the assessment of Head Start success have increasingly 
reflected the breadth of Head Start's objectives, sissessment technology 
still lags behind new perspectives on cultural differences. 

The origins of Head Stsirt have been described in many reports 
(Hellmuth, 196?, 1970; Auleta, 1969; Anderson and Shane, 1971; 
Evans, 1971; Steams, 1971; Braun and Edwards, 1972; Rowe, et.al., 
1972). To summarize briefly, the inspiration for Head Start sprang 
from the following sources: 

.Growing public and governmental support for efforts to 
reduce the obvious racial and economic inequalities 
in all parts of the country. This general concern 
led to the creation in 1964 of the Office of Economic 
Opportunity and to the declaration of the "War on 
Poverty . " 



•Increased recognition that educational inequalities are a 
major factor in creating and perpetuating inequalities 
in earning power and the general quality of li>d.ng. 1/ 
One example of this recognition was the 1954 United States 
Supreme Court ruling on racial desegregation of the schools, 
In the ensuing efforts to redress racial inequalities, it 
soon became evident that many of the inequalities were a 
function of socio-economic differences, Attention then began 
to focus on how compensatory education for children from 
low-income families might be used to help break the poverty 
cycle, 

,An accumulation of theory and evidence from early studies by 
the Iowa Child Welfare Group (Skeels and Dye, 1939) and 
from work by Hunt (l96l) and Bloom (l964) showing that 
environmental factors in the early childhood years are 
particularly powerful in shaping children's future growth 
and development. 

.Recognition of the disproportionate nximber of poor children 
with school problems and low achievement, and the hope that 
eaorly education might reduce their incidence. 



1/ The authors of the recently published study, Inequalityi A 
Reassessment of the Effects of Family and Schooling in America 
(Jencks, et,al., 1972), argue that schools have little long- 
term effect on the socio-economic success of the individuals 
who attend them. From his reanalysis of the Coleman Report 
data, Project Talent's Longitudinal Study, and Census Bureau 
data on adult occupational status and income, Jencks concludes 
that differences or inequalities in educational opportunities 
and schooling are unrelated to socio-economic differences 
among graduates and draws from this correlational evidence 
the policy implication that school reform is important 
for improving the quality of the lives of children but 
contributes only marginally to reducing adult inequality 
and poverty. 



12 



.Reseaxch findings indicating that disadvantaged children 
are deprived of many of the opportunities which seem 
necessaxy for growth during the early years, 2/ 

In an attempt to take a first step towards ameliorating the 
negative effects of poverty on children, Project Head Start was 
launched as an eight-week pilot program in Spring, 1965. When Head 
Start attained instant popularity, enrollments exploded from ^,000 
children in Spring, 196^, to over 500,000 children in Summer, 1965. 
Since it was recognized eaxly (Hunt, 1966) that a longer program 
might "be required to produce a more enduring and significant impact 
on children, families, and institutions, several Head Start programs 
were converted from eight-week pilots to full-yeax projects in 
August, 1965* However, few full-yeax programs actually became 
operational until 1966-1967 because of the problems involved in 
"starting-up*' such an effort, e.g., locating suitable sites and staff. 

In 1967-1968, almost JOO new full-yeax programs were f\mded aind 
some experimental Head Staxt-related projects were initiated: 
Parent-Child Centers to provide services to toddlers (see Costello 
and Binstock, 1970), and Project Follow Through to continue providing 
services to Head Start graduates (see Bissel, 197l). Thus, in the 
years I966-1968 the program had been initiated and rapidly expanded 
in all directions to serve increasing numbers of disadvantaged pre- 
school age children, and some younger and older disadvantaged children 
and their families as well. 



17 Since then, the existence of critical periods (i.e., times of 
maximum capacity for growth or susceptibility to irreversible 
damage), and the role of the preschool years as a critical 
period, have been challenged (Rohwer, 1970). Similarly, the 
relative value of child-focused (as contrasted to adult-focused) 
interventions has been questioned, based on the apparent 
failures of preschool, primary, and secondary age interventions 
(Averich and Donaldson, 1971; Hosteller and Stet, 1972). These 
recent reports do not deny the influence of the environment nor 
the importance of growth during the first six years of life; 
instead, they examine the relative costs and long-term benefits 
of different intervention approaches. In 1961-1965, when many 
of the first of these programs were initiated, however, 
intervention during the formative years was a logically clear 
majidate from the available data. 
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The 1968-1969 program year was the first during which Project 
Head Start could be considered to have achieved stable operations. 
This important historical fact provides some rationale for basing 
this summary report of I966-1969 evaluation studies almost 
exclusively on the 1968 data. 

It is also noteworthy that Head Start centers were not alone 
in their start-up year management problems. The Head Start regional 
Evaluation and Research Centers, scheduled to begin collecting 
national evaluation data in 1966, did not begin operations until 
September 1966 - January 1967. Since full -year Head Start classes 
began operation well before the fourteen Evaluation and Research 
Centers were ready, the data from 1966-1967 only approximates 
a "pre" and "post" study with an average interval between first and 
second testing of only four months. Fortunately, these start-up 
problems were minimized in the second and third evaluation years 
(1967 and 1968). The Evaluation and Research Centers continued 
operations until 1969, when another Head Start related project. 
Planned Variation (see Klein and Datta, 1972) was begun and the 
Centers were phased out. 

These considerations, while rarely recognized in discussions of 
Head Start, are nevertheless significant in interpreting evaluation 
data from Head Start and similar programs sis critical tests of the 
validity of conceptual positions on the effects of early intervention 
on child development. 

It is now recognized (Bissell, 197l) that even well-planned, 
conceptually integrated, and highly focused programs require at 
least three years to implement on a modest scale; this suggests that 
while findings from the 1966-1969 evaluations of Head Start and 
similar studies can have substantial value for identifying likely 
immediate and long-range effects, they cannot individually, or as an 
average, 2/ be interpreted as conclusive or critical tests of the 
validity of Head Start as a conceptually based solution to a social 
problem. 



^ See Light and Smith (l97l) for a discussion of issues in 
averaging results from several studies. 
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OVERVIEW OF HEAD START EVALUATIONS 



Evaluation Issues , Initially, two issues received considerable 
attention in Head Start evaluations, including evaluations funded by 
Project Head Start as well as those undertaken by other agencies and 
individual resesurchers: (a) Is the program delivering the services 
it is obligated to deliver to eligible children and families? 
(b) Is the program of benefit, as expected, to children and their 
families? 

The first issue focuses on the process of implementing a program 
and traces the stages of program development from the identification 
of a social need to the actual delivery of services and program impact. 
Addressing this issue enables researchers and prograrr;. planners to 
identify where breaJcdowns in the process occur and to determine 
whether a program has been ineffective or merely improperly implemented. 

The second issue focuses on outcome. Most outcome sissessments 
do not distinguish between levels of program implementation; undesir- 
able or negligible outcomes are often misinterpreted as indicators 
of program ineffectiveness rather than failure of program implementa- 
tion. Unfortunately, in large scale evaluations which incliide many 
programs of varying quality, program implementation becomes a matter 
of degree, or a vaoriable, that is usually not adequately controlled. 

In addition to these broad issues concerning program effective- 
ness, three issues of a more substantive nature have gadned in 
popularity since the beginning of Head Start and parallel the growing 
interest in the field of child development and early education. These 
issues are: (a) Does the success or failure of Head Start, when 
viewed solely as a compensatory education program, mean that compen- 
satory education has succeeded or failed? (b; How durable are the 
benefits of early versus later school interventions? (c) What 
program, ciirriculum model, or approach works best? 

Viewed in the context of these evaluation issues, the Head Start 
evaluation studies conducted in 1966-1969 and summaxized in this 
report can be described primarily as outcome assessments of different 
program approaches within Head Start to determine which kinds of 
prograims work best; the degrees of program implementation was roughly 
ascertained on the basis of program compliance with Head Start guide- 
lines. These studies were not designed to address the issues of the 
success of compensatory education or the durability of its effects; 
no control group data were collected and only immediate changes were 
measured. 

Evaluation Studies . From the perspective of the type of research 
study conducted, Head Start evaluations can be grouped as follows: 
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1. descriptive studies of program compliance with Head 
Start guidelines; 



2. one-site research and evaluation studies of individual 
Head. Start programs; 

3- national impact st\«iies of the overall effects of the 
Head Start program >riiere program characteristics are 
not linked to outcomes; and 

4. national interactive model studies of specific program 
characteristics as they are related to outcomes. 

!• Descripti ve Studies . Between summer 1965 and summer 1970, 
at the request of Head Start, surveys of random samples of Head Start 
programs, centers, classes, staff, and children were conducted by the 
Bureau of the Census. The questionnaires were designed by Head Start 
evaluation staff and program specialists to assess compliance with 
national guidelines and to describe Head Start programs. A serLps 
of reports prepared by Bates (1969, 1970, 1972), Office of Child 
Development, provide an extensive description of each summer and 
full-year program since 1965. In general, the findings indicate 
compliance with Head Start guidelines-, but also detail substantial 
variability along child, family, sfe-.f, rind program characteristics. 

The Census surveys, in tracing the development of various program 
components over the history of Head Start, show that while most 
programs have been in compliance with guidelines, program quality 
has been (a) uneven between program administration and services 
components, and (b) uneven from project and even from class to class. 
Variations seem, predictably, to be greater in the early programs 
than in more recent years; trend analyses have shown steady progress 
in many areas of significance to Head Start, such as provision of 
extensive in-service training and election of parents to Policv 
Councils . *' 

2* One-site Research a nd Evaluation Studies . Many of these 
single-project studies were funded by Head Start to assess the 
immediate effects of program participation on child development and, 
in some cases, to follow the children after they entered public 
school. z/ ^ 



57 Most of these reports are available through the ERIC Early 
Childhood Education Clearlnghoiise, University of Illinois, 
Urbana, Illinois 
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It should be noted that the research designs for Head Start 
evaluations typically are similar in methodological features to the 
research studies of experimental preschool programs, few of which 
permit drawing unamliguous inferences regarding program effects. 
Nonetheless, some of these single project ^vailuations do involve control 
groups of non-participating children, and many use pre and post 
measurements . ^ 

In general, most inteirvention programs have been found to produce 
a statistically significant, immediate effect on cognitive and 
lingxii.stic performance. These gains have been greatest when (a) the 
program was directed toward specific educational objectives, as in 
many "experimental" Head Start programs, ctnd (b) the children 
participated for a nine-month school year rather than a three-month 
summer session. 

Compared to the substantial documentation of cognitive and 
linguistic gains, data pertaining to the effects of early interven- 
tion on sociauL-emotional development are sparse. This is all the 
more regrettable since "increasing the self -concept" , for exeimple, 
is a nearly universal objective of Head Start programs. One major 
problem is that although there are no entirely satisfactory^ measures 
of cognitive, perceptual-motor, linguistic, and academic skills for 
preschoolers there are even fewer psychometrically acceptable measures 
of personal and social growth (see Hoefner, et.al., 1971; also see 
Walker, 1974). However, what evidence exists is positive (see 
Grotberg, 1969); there are numerous observational and anecdotal 
accounts of the effects of preschool interVehtions on improving 
self-confidence, peer relationships, interactions with adults, 
curiosity, and other personal-social characteristics of disadvantaged 
preschool age children. 

With due regard for methodological and other limit to present 
knowledge, most researchers conclude that immediate changes in the 
development of low-inome preschool childxen are attributable to pre- 
school intervention programs, outcomes which are within the current 
state-of-the-art for small scale programs and are theoretically 
possible in laxger scale prograjns. 



^7 The findings from the Head Start one-site research stiadies 

have been reviewed at length by Grotberg (l969), Datta (1969)1 
Steams (l97l)» Butler (l970), and others. 
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3' National Impact Studies . Only one national Impact study has 
been reported by Head Start where the objective has been to assess 
performance changes without assessing the specific conditions under 
which such changes occur. Known as the Westlnghouse Report 
(Clcarelll, et.al., 1969), this study was funded by the OEO Division 
of Research, Program Planning, and Evaluation In Summer, 1968. The 
design compared the performance of Head Start and non-Head Start 
children in 10^ sites who were attending first, second, and third grades 
in October-November, 1968. About two-thirds of the Head Start children 
had previously attended summer programs and one-third had previously 
attended full-year programs. Although aspects of the study are 
considered methodologically controversial (Smith and Blssell, 197O; 
Campbell and Erlenbacher, 1970), the findings reported in the Westing- 
house Report are similar to those of one-site studies discussed earlier; 
children tested in first grade soon after leaving full-year Head Start 
achieved higher scores on cognitive measures than did non-Head Start 
controls; however, by second or third grade Head Start children on 
these cognitive measures. Moreover, the Westlnghouse Report found no 
benefits, either cognitive or motivational, associated with summer 
attendance in Head. Start. 

While the findings of the Westlnghouse Report were similar to 
those of previously described one-site studies, the conclusions drawn 
were not. Most researchers interpreting similar findings concluded 
that Head Start programs should be extended through primary school, 
reasoning that continued intervention would help to sustain the effects. 
Clcarelll, et.al., concluded however, that most Head Start programs did 
not produce the expected outcomes and recommended curtailing the 
projects and modifying the full-year Head Start program by installing 
known successful preschool education models. An experimental approach 
to Head Start was strongly urged, with further program expansion to 
serve more children suspended until proof of the effectiveness of these 
changes became available (note that Head Start serves only approximately 
11% of children eligible as defined by the age and poverty criteria). 

, , ^' National Interaction M odel Studies . Most national evaluations 
funded by Head Start have been based on an interactive assumption: 
the belief that different programs have different effects on different 
children. Prompted in part by Hunt's description of the Importance of 
the proper "match" between the child's competencies and the program's 
challenges, and in part by the diversity of Head Start children which 
seemed too great to be quality well served by one approach, this belief 
has been shared by such researchers as DlLorenzo and Salter (1969) . 
Karnes (l969), and Miller (l97l) who have investigated the different 
effects of several well-known curricula. 

Head Start national evaluations have Includeds a review of the 
summer 1965 reports (Planning Research Corporation, 1966); an analysis 
of the associations of the first full year (1966) program experience 



13 



(Plctnning Research Corporation, 196?); an evaluation of the Summer, 
1966, Head Start programs (Boyd, I966); a study of the effects of 
Head Start on community institutions (Kirschner Associates, Klein and 
Datta, 1971); and the Educational Testing Service Longitudinal Stiody 
of Disadvantaged Children and their First School Experiences 
(Shipman, 1970). B'rom I966 to 1969f the national evaluation program 
was undertaken "by the 1^ Head Start Evaluation and Research Centers, 
(see Table l) . It should be noted that in all of these studies 
assessments of program and teacher characteristics and of classroom 
process variables have received as much attention as assessment of 
initial and final levels of performance. Analyses of these evaluations , 
including the I966-I969 studies, .have been directed at identifying the 
conditions associated with greatest p:ain for different children . 

Yet, despite the widespread belief in the interactive assumption 
and commitment to investigating its validity for preschool interven- 
tion, there has been to date relatively little direct evidence from 
Head Start data showing that child, family .or even program characteris- 
tics are differentially associated with the magnitude of gains made in 
the program. While in every study some differential effects and 
interactions have been reported, their size and relative infrequency 
is more striking than their presence. But these earlier data in 
particular are fraught with methodological problems which likely 
obscurred interactions, e.g., lack of interactions can be attributed 
to the relatively restricted range of classroom characteristics in 
the 1965 and 1966 programs, and/or to the restricted range 01 outcome 
variables (nearly all of them academically oriented) which were studied. 

As a comprehensive child development prograim, of which clsLSsroom 
experience is but one key component. Head Start has been influenced 
by both educational and developmental aspects of research on the 
effects of early intervention. The nature of the Head Start program — 
broadly child development oriented in its objectives, and, in the ccLse 
of the classroom component, more narrowly education-oriented — siiggests 
that at all decision making levels the interplay or interaction between 
clajssroom/teacher and child characteristics must be taken into account. 

The need to assess interactions as sensitively as i)ossible has 
been reflected implicitly in the evolution of Head Start national 
evaluations from the simple random sample designs of Summer 1966 to the 
planned variations and interventions of I969, 1970, and 1971. The 
evaluation studies siammarized in the present report represent an 
intermediate stage in this evolution. They do not provide the most 
direct and sensitive test of program-child interactions; nor do they,, 
with the exception of the 1968-I969 data, sample a wide range of 
program variations. Nonetheless, the 1966-I969 evaluation studies 
were directed, to as great an extent as possible, to what was 
identified as one of the central issues in early childhood interven- 
tion: What programs work best and for what kinds of children . 



19 



TABLE 1 

HEAD START: NATIONAL RESEARCH AND EVALUATION STUDIES, 1965 - 1972 



YEAR 


PROJECT 


Summer 1965 


Local reseai*ch studies; PRC report; 
Census Study 


Summer 1966 

Full Year 1966-1967 


ETS national evaluation, Census survey 
c»ocx4 uenuer suuQy; L#ensus survey; 
PRC report 


Summer 196? 

Full Year 1967-1968 


Local research studies; Census survey 
E<SbR Center study; Census survey 


Summer 1968 

Full Year I968-1969 


Local reseaxch; Census survey 
E&R Center study; Census survey j ETS 
longitudinal study, Yecir 1; 
(Westinghouse report, Spring 1969) 


Slimmer 1969 

Full Year 1969-197O 


oensus survey 

Planned Variation, Yeair 1; ETS 
longitudinal study. Year 2| Census 
survey; community impact study 


Summer 1970 

Full Year 197O-I971 


Census sur*/ey 

Planned Variation, Yeax 2; ETS 
longitudinal study. Year 3; Census 
survey 


Summer 1971 

Full Year 1971-1972 


Health Start, Year 1 
Planned Variation, Year 3; Home Start, 
Year 1; ETS longitudinal study. Year 4j 
parent participation study; staff 
mobility study . 


Summer 1972 

Full Year 1972-1973 


Health Start, Year 2 

Home Start, Year 2; ETS longitudinal 

study. Year 5; others in process 
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DESIGN AND MEASUREMENT 



METHODOLOGICAL ISSUES 

Evaluation methodology has changed rapidly since 1965 when the 
first Head Start programs were founded. So rapid are thase changes 
that in the two or three year time span between initiatirg a study 
and preparing a final report, the research design may have become 
obsolete. 

Exajnined from this evolving methodological perspective, Head 
Start program evaluations designed in 1970-1973 doce likely to differ 
from those designed in earlier yeairs in three major respects: 

. more realistic estimates of the length of time required to 
implement a new pro-am or program objective . Recent 
evaluations of new Head Start programs such as Health Start 
(Urban Institute; final report completed 1973) and Home Start 
(High Scope Foundation; final report due 1975) focus the first 
year evaluation activity on assessing management and administra- 
tion including staff selection and training and then, during 
the second or third year of the program, stssess impact on the 
target population. 

. more closely linked to program ob.iectives . Today's evaluators 
work directly with program planners and managers to identify 
appropriate criteria of program success and realistic dates 
when short-term aJid long-term effects can be expected. Today's 
evaluators are more conscious of the need for developing a 
repertoire of sensitive and reliable outcome measures or at 
least alerting program planners when no satisfactory outcome 
measures are available to assess specific program objectives. 
Recent evaluations are thus more likely to gather data by 
direct measurement of sp3cific behaviors that are expected 
to be influenced by the program and less likely to draw upon 
data from indirect measures of general traiits or abilities. 

. more rigorously experimental in design . The limitations of 
statistical manipulations or aata to compensate for playing 

fast and loose with the design have become more apparent in 

recent years. The problem is not primarily one of assessing 

change; assessing change is a relatively straightforward 

matter of comparing initial with later measurements. The 

problem is one of ascribing change unambiguously to the program 

intervention or ascribing relative change to different programs. 

In order to conclude that an outcome actually resulted from the 

program Input, evaluation theorists argue that certain niinimal 

design criteria miist be upheld, even at the expense of scaling- 

down national studies. 
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The studies summarized in. this report were designed with some, 
but not perfect, foresight into these developments. Among the 
limitations are: 

1. Lack of a control group of children who have not attended 
Head Start. 

2. Non-random assignment of children to different program 
types within Head Start. 

3. Non-uniformity of time and conditions of data collection. 
On the other hand, the advantages of these data include: 

1. Extensive descriptions of what actually happened in the 
classrooms . 

2. Data on child, family, and program characteristics for 
large national samples for three consecutive years. 

3. Assessment of personal and social as well as academic and 
cognitive development. 

^. Measurement at two points in time — roughly the beginning 
and end of the program. 



DESIGN ISSUES 



There are two design issues addressed in the I966-1969 evaluation 
studies which are especially significant for early childhood interven- 
tion research: the measurement of social-emotional development and 
the measurement of program-child interactions. 

Measuring social-emo tional development . Large-scale studies of 
child development programs require tests which are easily administered 
and scored, with high statistical reliability and validity. 
Unfortunately, there are few measures of young children's social- 
emotional development which meet these criteria. The best data in 
these non-cognitive areas come not from standardized tests, but from 
controlled time-sampling of children's behavior and clinical observa- 
tions of children. For example, it has been argued convincinfrlv bv 
Zigler and Butterfield (1968) that changes in personal-social 
attitudes rather than changes in actual cognitive abilities account for 
the small increases in IQ test scores produced by early intervention 
programs. Zigler and Butterfield maintain that social and motivational 
factors can be readily influenced by early childhood interventions and 
that these changes in turn improve performance in the testing situation. 
More recent studies by Emmerich (l97l) and Dittman, et.al. (1971) using 
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observational techniques, lend support to this argument "by reiXDrting 
substantial socialization gsdns for Head Start children during the 
first half of the program. 

A common misconception is that Head Start evaluations and other 
research studies have neglected to collect personal-social data or to 
develop new measures. If anything, however, more has been invested 
in this area and in studies of ongoi.ng classroom activities thstn in the 
measurement of cognitive and linguistic changes. With respect to 
national evaluations conducted in 1966-1969, a variety of rating scales 
of social-personal development were prepared. These included not only 
a vaxlety of children's behavior rating-scales to be used by teachers 
and testers, but experimental tests of achievement motivation and 
extensive observations of children's unstructured social interactions 
with older children and adults. The data have proved expensive to 
plan for, to collect and regrettably, to analyze. This summary report, 
in addition to the final reports prepared by System Development 
Corporation and Research Triangle Institute 6/ represent analyses of 
only a fraction of the total data banks resulting from the 1966-1969 
evaluations. Subsequent analyses of these data are likely to prove a 
fruitful source of new knowledge in the area of personal-social develop- 
ment. 

Mea^urinK interactions between child and program characteristics . 
The 1966-1969 evsuLuations were desienied to test the "interactive 
hypothesis" or interactive approach to eaxly child development 
programs J that some kinds of program experience may be better than 
others for certain children and that what is the best program for one 
child may not be the best for another. The most direct test of the 
interactive hypothesis would req^uire a comparison between individualized 
programs (where children's needs/competencies are matched to curriculum) 
and undifferentiated programs (where all children receive the same 
curriculum) . If the interactive hypothesis were substantiated, more 
children would be expected to progress in individualized programs than 
in the undifferentiated programs. 

The 1966-1969 evaluations provide an indirect test of the inter- 
active hypothesis; children were not matched to curriculum. However, 
the design did provide a systematic search for (a) overall changes, 

(b) program characteristics differentially eissociated with changes, 

(c) child characteristics differentially associated with changes, and 

(d) patterns of child/family and teacher/program characteristics 
differentially associated with changes. 

6/ Final reports prepared by System Development Corporation and 

?n^r''n5.'^^^^ Institute on the evaluation of Head Start from 
1966-1969 are filed in ERIC/ECE (Early Childhood Education 
Clearinghouse, in Urbana, Illinois). 



23 



DATA COLLECTION 



The evaluation data were collected in 1966-1967, 1967-1968, and 
1968-1969 by m. regional university-based Evaluation and Research 
Centers. 7/ The evaluation designs and types of measures developed 
during the three years of evaluations are described below and are 
suim-narized in Table 2 (see also Datta, 1969). 

Evaluation data for iq6^-iQ^7 Head Start classes were selected 
for study by the Centers to provide a representative sample of Head 
btart programs. Programs were selected for variability on the follow- 
ing characteristics: type of delegated agency operating the Head Start 
program (Community Action Agency or Local Educational Agency) , geograph- 
^^,3°°^^^°". ^r^^ °^ urban); ethnic backgound of the child (Black, 
White. Spanish-sumamed, Polynesian). The Evaluation and Research 
Centers studied as many Head Start classes as possible within the con- 
fines of their geographic areas, drawing samples of six to eight chil- 
dren from each class. 

^ Evaluation data for I967-I968. Head Start classes were selected 
for study by the Centers to provide a more differentiated sample of Head 
Start classroom activity, but not necessarily a samnle that was rep- 

fic^tion\^J S^lo^^"". The purpose of this sampling modi- 

lication waste provide a greater opportunity to assess differential 

K ! ^ classroom, i.e., interactions between program and child 

characteristics. This more differentiated sample of classSom Sti^ty 

on%hp^^^ f "^^^ selected predominantly 

on the basis of classroom style and process characteristics, although 
data were also gathered on those characteristics identified in the 1966 
evaluation design; and second, all children within each class were in- 
cluded in the sample .2/ ^xaoo ±n 



^ Evaluation and Research Centers were in operation from 

1966-1969. They were established by the Head Start National Office 
Hp^h ^+ ^TfT- °^',^«^«l°Pi"S and selecting, with guidance from the 

iQ^ National Office staff, the designs and measures for the 
1966-1969 evaluation studies. Data were then collected and analyzed 
S ! J^xJ^r.!^'^^'' direction of the national office. It should 
be noted that the beginning of full -year Head Start classes in 1966 
Rii"°\°n^''i^P completely with the operation of the Evaluation and 
Research Centers. As a result, the 1966-1967 data do not provide 
f^??;^l ft- assessment- the average interval between 

initial and final testing was only four months and some children were 
enrolled as long as seven months before they were first tested. 

8/ In the 1967-1968 evaluation study, extensive data were collected on 
classroom processes, social-motivation and perceptual-cognitive devel 
opment j-uch of these data remain to be aSlyid and a^e rJ^erred 
to collectively as "cluster data", indicating how these particular 
data were collected by four' clusters of Evaluation and Research Cen- 

OCX'S • 
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TABI£ 2 

HE5AD START NATIONAL EVALUATION DESIGNS AND MEASURES 1966, 196? and 1968 



1966 



1967 



1968 



Design 

N Classes studied 

N children tested 

Basis for selecting 
classes 



225 
1806 



177 
1889 



148 
1989 



variability variability variability 

by categories by classroom by experl- 

of program style and mental inter- 

chlld process ventions 

characteris- characteris- 



tics 



tics 



Selection of children random all children no prior 

within in each K/S 

clciss class experience 

Common Variables Across Years 



Classroom activity 


observer 
form (ORF) 

M 


observa- 
tions of 
clcLssroom 
activity 
(OSCI I) 


observa- 
tions of 
classroom 
activity 
(OSCI II); 
rating of 
teacher 
behavior 
(POT), 
teacher 
question- 
naire 


Teacher characteristics 


question- 
naire 


question- 
naire 


question- 
naire 


Class resoiirces aind 
facilities 


observer 
form 


observer 
form ' 


observer 
form 


Family background 


interview 


inteirvlew 


interview 


Admlnls tration 






checklist 


Cognitive performance 


IQ test 
(Binet) 


IQ test 
(Binet) 


IQ test 
(Binet) 
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TABIB 2 (cont.) 



Achievement 



Motivation 



Social Adjustment 



Cognitive Style 

Data Collection 

Mean week of Pretest 
Mean week of Post- test 
Mean inteirval of weeks 

% with prior Head Start 



test 

instrument 
(PSI) 

teacher 
rating 



tester 
rating 



1968 



12 
28 
16 



tester 
rating 



observation 
of 

children's 
social 
interactions 
(SIO) 



test 

instrument 
(PSI) 

tester 
rating; 
experi- 
mental 
measiire 
(GUMP) 

sociometric 
instrument 



15% 



rating 
instrument 
(Hertzig- 
Birch) 



7 

32 
25 
29% 
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Evaliiatlon data for 1968-1969* Mauiy Evaluation and Research Center 
directors madntained that sampling on the basis of the "natural varia- 
tion" in classroom activity, as in the 196? study, included too few sites 
that were examplars of new approaches in early childhood education. 
Moreover, many directors pointed out that the degree of program effective 
ness, in terms of child development outcomes, was not uniformly high 
across aill Head Start programs. Thus in 1968, each Evaluation and 
Research Center director developed a reseaxch design which compared 
"regular" Head Start classes (reflecting an experimental and general 
approach) with "intervention classes" (reflecting an experimental 
approach or special educational emphasis). The "interventions" 
undertaken were diverse and included projects emphasizing lajiguage, 
basic cognitive processes, parent education, community participation 
in the setting of prograim objectives and physical facilities. The 
psarticular types of intervention inclined in the 1968 evaluation study 
axe summarized in Table 3« It was reasoned that an evaluation design 
comparing "regular" with "intervention" Head Start classes would permit 
drawing clearer inferences about differential program effects and program 
child interactions, at the expense of a less programmatically-represen- 
tative sainple of Head Start classes. In the 1968 study there were no 
additional sampling reqiolrements other than selecting children with no 
more than four months of prior Heaxi Start experience and maintaining 
the geographic distribution which was the natural result of the location 
of the Centers. Most Center directors chose sites relatively near their 
universities which would maintain the desired balance of rural and urban 
settings. 

MSASURES 

In every prograin evaluation to date, an attempt has been made to 
collect data relevant to the several Head Start objectives; the attempt 
has been more successful with parent and class characteristics than 
with child characteristics. It has proved difficult to find or construct 
reliable, easily arbninisteared, readily interpretable measures which are 
aiso psychologically meaningful for different age and cultural groups. 
In many studies, the evaluator must choose between investing in (a) 
untried measures that look meaningful but may prove to be costly, in- 
sensitive, or unreliable, and (b) standardized measures that are in- 
appropriate developmen tally or culturally, and may prove to be loninter- 
;ret?,ble or misleading because the usual interpretation is invalid for 
the Head Start population. The compromise of the 1966-1969 evaluation 
studies was to use both new and standard instruments, and to interpret 
with care. 

For purposes of this summaury report, the data from the 1966-196? 
and 1967-1968 evaluation studies will be used selectively only to 

ndicate replication of important findings obtained in the 1968-I969 data 
..r to highlight a theoretically meaningful point. Detailed descilptions 
of the variables and measures used in these evaluation studies are 
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TABLE 3 

EVALUATION AND RESEARCH CENTER EVALUATION DESIGNS FOR I968 - I969 



Cente 



N ClEisseS' 
R I 



:2/ 



Objective of Intervention or Study 



Boston 
Syraciise 
Bank Street 

Temple 

South Carolina 

Tiaane 

Southern 

Texas 

Kansas 

Michigan 

UCLA 

Hawaii 



Ik 
8 

6 
2 
2 
1 
2 



8 

8 

6 
12 
10 
11 
11 

7 

9 
10 

8 



Effects of community participation 

Teacher praise/blame behavior 

Comparison of Bank Street and 
"structured" HS 

Effects of gireatly enriched materials 
Language development program^ 
Language development program^ 
Language/parent education^ 
Language development program^ 
Behavior modification training 
Piagetian training-cognition 
Teacher/parent goal setting 
Languaige/parent education^ 



1/ The University of Chicago (Dr. V. Shipman) and Teachers' College 
(Dr. R. Thomdike) participated in the I966-I967 and 1967-I968 
studies only. 

2/ Tulane, South Carolina and Texas had the same design and program; 
Hawaii/Southem also heui the same design and program. 

2/ Intervention (l) Head Start classes receiving experimental interven- 
tion programs were compared with Regular (r) Head Start classes 
receiving no experimental intervention. 
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sumraaxized in Table 2 and Appendix A, Since this summary report is 
based predominetntly on the 1968-I969 data, the characteristics 
measured in I968-I969 are described in the following section and explained 
more thoroughly in Appendix A, 

Characteristics a nd measures used in the 1968-1969 evaluation study . 
The 1968-1969 data inclxode information on children and families as well 
as teachers and programs. The data were collected using a total of 24 
forms, questionnaires, or test instruments. Additional demographic 
information (e.g., age, sex, urbanicity, ethnic background) was already 
available for all children on a master data card. 

1, Cognitive Development Characteristics 

Intelligence . The Stanford-Binet .IQ test, a commercially 
available, stajidardized, and widely used instrument, was 
employed as a measure of general intelligence. Scores 
were interpreted as reflecting a combination of experience, 
learning ability, cUid motivation. As a measure of performance, 
and not necessaxily an accurate measure of underlying capacity, 
Stanford-Binet scores have been found to be sensitive to a 
variety of educational interventions and predictive of later 
school achievement. 

Ability to learn a new task . In 1968-I969 this variable was 
assessed by the subscale of the Wechsler Preschool and 
Primary Scale of Intelligence (WPPSi), The subscale, entitled 
Animal House, is a relatively culture-free and non-verbal 
measure of this ajspect of cognitive functioning. 

Achievement and school readiness . One aspect of cognitive 
functioning which is not tapped by standardized IQ tests such 
as the Stanford-Binet is cognitive functioning at the more 
concrete level. The Caldwell-Soule Preschool Inventory was 
used as a measure of achievement and school readiness. It was 
designed to be especially sensitive to the cognitive changes 
expected from preschool intervention programs, 

2, Social -Emotional and Motivational Characteristics 

Ad.justment to a novel (testing) situation . When a standard 
ized test such as the Stanford-Binet is administered to a child, 
the child's behavior during the testing session, independent 
of his performance on the test, can be considered an important 
index of certain aspects of child development. The Inventory 
of Factors Affecting the Stanford-Binet is a rating instru- 
ment that was used by examiners to assess children's behavior 
during the test. The ratings are interpreted as reflecting 
traits such as impulsivity, concentration, and activity levels. 
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Cognitive style . Cognitive style is yet another aspect of 
child development that can be inferred from the child's 
behavior during the Stanford-Binet testing session. The 
Hertzig-Birch Response Style instrument was designed and 
used to measure various cognitive styles that the child 
employs when responding to test items. For example, a 
distinction is made between an incorrect response where the 
child has "worked at" a problem solution and an incorrect 
response where he has not. 

Achievement motivation . Achievement motivation, an impor-tant 
component of actual shool achievement, was assessed by means 
of an experimental measure (Gumpgookies) that required the 
child to choose between two pillow-like creatures, one of 
whom is engaged in achievement oriented behaviors and one 
of whom is not. 

Children's sociability . Children's relative popularity or 
isolation in the classroom, as rated by other children, was 
assessed by means of a Sociometric Play Situation instrument. 
Because of difficulties encountered in scoring the test, 
only social isolation scores were included in the data 
analysis , 

3. Parent and Family Characteristics 

Socio-economic status, parental attitudes, styles of parent - 
child in teractions, and household routines , A structured 
Parent Interview with parents (usually the mother) was given 
at the beginning and at the end of the program for the 
purpose of collecting information on these characteristics, 
^. Teacher and Program Characteristics 

Data on teacher and program characteristics came from two 
types of measures: self-reports by the teachers and 
objective classroom observations by trained observers. 

Characteristics measured by means of self-reports included: 

Program emphasis. Roals. and teaching approach . Teachers 
provided information on these characteristics in a 
structured Teacher Interview, 

Administrative variables. Teachers provided information on a 
host of administrative related variables by completing a 
Checklist of Administrative Variables. 

Te^ her's backgrjund. Each teacher completed a questionnaire 
(C .racteristics of Teaching Staff Form) which requested 
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demographic, educational and experiential background 
information about the teacher and teacher aides in each 
claissroom. 

Program facilities and resources . Teachers provided 
information about the materials available in the Head Start 
centers, along with details of the daily schedule and yearly 
calendar, on an instrument called the Class Facilities and 
Resources Inventory, 

Characteristics measured by means of observational techniques 
included: 

Teacher's behavior in the classroom . A trained observer rated 
each teacher on 33 five-point scales (Post Observation 
Teacher Rating Scale) reflecting behaviors such as reliance 
on on-going classroom activities. 

Teacher's and children's behaviors in the classroom . An 
observational instrument (Observation of Substantive 
Curriculax Input) consisting of a complex time schedule was 
used by trained observers to record the behavior of both 
children and the teachers in the Head Start claissroom. Among 
the most important codes were those reflecting the context 
and context of the children's activity, the form of teacher 
control used, and the extent of teacher involvement in the 
activity. 2/ 

STATISTICAL ANALYTIC MODELS 

Two contractors (Research Triajigle Institute, RTI, and System 
Development Corporation, SDC) were selected for the purpose of 
analyzing these evaluation data; both were provided with identical 
data files. The contractors were instructed, however, to make inde- 
pendent decisions about the selection of variables and choice of 
analytic techniques. As would be expected, the contractors differed in 
their decisions. A complete description of these differences is 
presented in Appendix B, but two important differences in their 
approaches axe worthy of mention here. 

First, although both contractors used a two-stage process for 
identifying and selecting a workable set of variables to include in the 
statistical ajialyses, the sequence of the stages differed. Research 
Triangle Institute selected variables first on a rational basis and then 

2/ Factor analyses of these data identified six teacher and classroom 
vajriables to be utilized in the subsequent statistical analyses. 
These variables are identified and briefly described in Appendix 
A. 
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on a statistical basisj 90 variables that appeared relevant to outcomes 
were selected and then screened for psychometric shortcomings (missing 
data, low variability, low internal consistency and high redundancy). 
System Development Corporation, on the other hand, selected variables 
first using a statistical criterion and then a rational criterion: 
203 variables were selected on the basis of changes in correlation 
with pre and post scores on perfonnance variables; then other variables 
were added which appeared to relate, hypothetically , to the questions 
addressed in the study. 

It should be noted that RTI and SDC used similar methods to assess 
the influence of child, family, and program characteristics on 
performance measures - i.e., " ^i^ scores", the difference between pre 
and post measurement, adjusted for pre score level. However, the two 
contractors adopted very different methods for examining the interactions 
between child/family and program characteristics. 

RTI analyzed gain scores in the context of several child and 
program factors simultaneously. For example, RTI analyzed Stanfoid- 
Binet gain scores as function of child's geographic region, his pre 
score on the Stanford-Binet, his age, and the extent of program input 
or classroom structure. Other analyses took into account factors such 
as the child's gender,^ socio-economic status of the family, length of 
pre test - post test interval, and different programmatic characteris- 
tics. This method is maximally sensitive to the effects of a particular 
v^iable when the effects of all the other influencing variables are 
taken into account. As a statistical approach, it is especially useful 
for singling out the relative effects of several interdependent 
^ confounded"; influencing variables. At the same time, however, it is 
quite insensitive to the interactions between such variables, 
especially interactions of small magnitude. 

In contrast, the SDC analyses were less cautious about inter- 
dependency among influencing variables but more sensitive to small 
interactions. Rather than take several factors into account 
simultaneously, SDC computed for each program variable, separate analyses 
for subgroups of children broken-out by age (younger or older) , pre- 
test IQ (high, middle or low), urbanicitv (urban or non-iirban) , and 
region (South or non-South) . Interactions were inferred if relation- 
ships between program variables and outcomes occurred for one' subgroup 
and not for the other, or if the direction of the relationship between 
program variables and outcomes was different for different subgroups . 10/ 

10/ SDC's procedure resulted in over 100 one-way analyses of variance 
computed separately for each influencing variable ^ It is probable 
on the basis of chance that the results of some of these analyses 
are spuripus. RTI's procedure, by comparison, consisted of a 
. series of four-way ajialyses of variance which examined gain scores 
as a function of several influencing vajriables simultaneously. It 
is less probable that the statistical interactions reported by RTI 
are spurious; but it is possible that some true interactions may 
have gone undetected in this model. 
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Neither of the two approaches Is more correct than the other * If 
the results are diffeirent, it is because slightly different analytical 
questions are being asked by the two models. The juxtaposition of the 
two approaches in this report should serve as a reminder that, 
computers nothwithstanding, data analysis remains much a matter of 
judgment. 
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FINDINGS 



The findings reported here are based on the 1968-I969 evaluation 
data; results from the 1967-1968 and- I966-1967 data are indicated only 
where parallel variables exist and meaningful comparisons with the 
1968-1969 data can be made. In general, the statistical analyses 
undertaken by System Development Corporation and Research Triangle 
Institute report similar findings and lead to similar conclusions. 
However, important differences between the SDG and RTI analyses do 
exist and are noted and explained when possible, for purposes of this 
summary report, only very reliable, statistically significant ( p < .01) 
findings are presented except when the psychological significance of a 
particular variable or relationship warrants attention to less reliable 
(i.e., p <; .05) findings. The findings are organized according to 
the particular substantive question they address, as determined by the 
overall design of these evaluation stiadies. 



SNTEHING CHARACTERISTICS AND PHESCORE PERFOHMAUCB 
What Were the Head Start Children and Families Like ? 

The children included in the 1968-I969 study ranged from two and 
one-half to six years of age at the time of initial testing. About 
bOl^ were preschool age (48 to 59 months) and about 22?S were kindergar- 
ten age (over 60 months). About one-third lived in the South (3^^) 
and over three-quarters lived in cities with populations over 50,000. 
Sf"^t,^/S^ S^°"PS were represented, but the largest proportions were 
Black (68^), White (l8?&), Mexican-American (6^). and Polynesian (/+%). 
The sample was evenly divided between boys and girls (515^ and J^9% 
respectively). Compared to the sample of children in 1967-1968, the 
children in the I968-1969 study were a little older, more urban, more 
Southern, and more Black. Compared to the population of all Head Start 
children, as described in Head Start census surveys, the sample under- 
represented older children and over-represented Black children. 

On the whole. Head Start children came from families with low 
incomes and little education. The median family income in 1968-1969 
was slightly above $4000 per year for the 6.? persons in each family 
(the corresponding figure for all Head Start families is $3210). 
Still the per capita income of the Head Start children was considerably 
below the national average (about $59? as opposed to $2,213). About 
A 2 ! parents (mothers and fathers) had not completed high school. 

About 60J of the mothers were unemployed; of the remainder, most were 

i °^ semi-skilled workers (13% and 19% respectively). Only 
11% of the fathers (living at home) were unemployed, the majority hold- 
ing unskilled or semi-skilled jobs (38% and ■&%) . Most.r..,vealing about 
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the socio-economic status of the Head Start families is the fact that 
61% of the families relied on parental earnings and were still 
eligible for Head Start — even two "breadwinners" did not keep families 
above the poverty live. Sub-employment , rather than unemployment, 
accounted for the poverty of most Head Start families. Over one-half 
of the children (55%) lived with both parents, but nearly one-third 
(29J^) lived with only their mothers. The levels of parental education 
and employment, and the proportion of children residing in two-parent 
families were essentially the same in 1967-1968, 

The parents of Head Start children began the program with over- 
whelmingly favorable attitudes toward it ~ 89% had the highest 
possible score on this scale. Their views about the value of education 
were also positive, but in the middle of the range of possible scores. 
On the other hand. Head Start parents did not reveal strong feelings of 
personal power over their environment or their children's schools nor 
a high degree of involvement in the community; overall they demonstra- 
ted moderately strong feelings of alienation. In 1967-1968 peirents' 
attitudes were equally positive toward Head Start (92% indicated they 
would send ainother child to the program) but slightly less positive 
toward education in general. 

When interviewed shortly after the 1968-1969 Head Staxt yeeir began, 
mothers reported a marked difference between their SLspirations and 
realistic expectations for their child's future. About half of the 
parents had high aspirations for their child's career and education 
(college degree), but only l^fo had correspondingly high expectations 
for actualizing their goals. In 1967-1968, by comparison, mothers' 
aspirations were slightly lower (80% aspired to some post-high school 
education for their child) and the discrepajicy between their aspirations 
and expectations was slightly greater (30?o expected to realize their 
goals). 

Summary. The 1968-1969 sample of Head Start children varied in 
age, sex, geographic location and ethnicity but was not completely 
representative of the total Head Start population: fewer older 
children and more Black children were included in the sample. Head 
Start children came from poor families that were below the national 
aversige in income and education and who suffered more from sub- 
employment than unemployment. Although the parents had favorable 
attitudes toward the program and education in general, they felt little 
personal power over their environment including the schools. Consistent 
with their feelings of alienation. Head Start mothers revealed a sub- 
stantial discrepancy between aspirations and expectations for their 
children's education. 

Were Differences Amonp; Children's Prescores Related to Background 
Characteristics of the Children ajid Families ? 

At the time of initial testing, children's scores varied on 
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cognitive and emotional outcome measures (described in the preceding 
chapter). The question then arose whether the differences were 
systematically related to any "background characteristics of the 
children or their families. Several statistically reliable and sub- 
stantial relationships between the child/family characteristids and 
prescores were indeed found. These relationships express only associa- 
tion between the entering characteristics and the prescores; they do not 
indicate causality. 

With the exception of the influence of prior Head Start experience, 
all of the relationships presented in this section are taken from the 
RTI analyses. Unless otherwise noted, comparable data from 1967-1968 
were not available in the two reports. 

At the time of initial testing, differences in children's pre- 
scores were systematically related to certain entering characteristics. 
These relationships are as follows: 

1. Younger c hildren performed better on cognitive tasks but 
appeared l ess we ll-adjusted socially . YnungPT- nMiHr-pn n^mpo,.^^ „ith 
older children had higher scores on measures of intelligence, ability 
to learn a new task, and achievement motivation. However, they also 
showed more behavior and motivation problems and greater feelings of 
xnadequacy.^ ^ 

2 • Children living with only their mothers showed more behavior 
problems . Children from single-parent families, compared with children 
irom two-parent families, showed a greater incidence of behavior 
problems in the testing situation, but they did not differ along other 
dimensions of social adjustment - i.e., feelings of inadequacy and 
motivational problems. 

I' Children with high aspiring mothers p erformed better on n ofrni - 
gZlJag^ The mother's level of aspiration f or her child's education 
was consistently and positively related to the child's prescores on 
measures of intelligence, school readiness, ability to LLn a new task 
and achievement motivation. On the Stanford-Binet IQ test, for eLmSe 
children with high aspiring mothers scored seven points highe? t^^ ' 
children with low aspiring mothers (89 vs 82 respectively). All of these 
^^oio i^! "^""^ positive and were found in both the I968-1969 and 
1967-1968 data. It should also be noted that similar patterns were found 
with respect to the mother's educational expectations for her cSS, 
again in both sets of data. ^ cnixa, 

11/ Younger children were not distributed evenly across all geographic 
regions. Thus the pattern described for younger children may well 
be the result of other factors associated with (and not separable 
from) younger children in this study, specifically, geographic 
region. 
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Jif. Children vdth optimistic parents performed better o n cognitive • 
tasks . Children of parents who were relatively less pessimistic about 
life scored higher on measures of intelligence! school readiness, and 
the ability to learn a new task. The difference on the Stanford-Binet 
was especially noteworthy: children of parents rated low in pessimism 
scored 8.^ points above children whose parents rated high in pessimism. 

5. Children who were frequently read to performed better on 
c ognitive tasks . Children who were read to frequently by adults and who 
Yisd relatively easy access to adults tended to have higher prescores on 
measures of school readiness. The difference was not statistically 
significant for the factor of accessibility but was both statistically 
significant and sutetantial for frequency of reading: children who were 
"seldom or never read to" obtained scores approximately eleven points 
below scores of children whose parents read to them "very frequently", 

6. Children whose parents seldom used physical punishment per - 
formed better on cognitive tasks . Children whose parents made little 
use of physical punishment obtsdned Stanford-Binet prescores 9«5 points 
above those obtained by children whose parents reported using severe 
physical punishment. Curiously, this relationship occurred only on 
measures of cognitive performance, and not social-emotional development. 

7. Children with prior Head Stgirt experience performed better on 
cognitive and social-emotional tasks . About 20% of the children studied 
had had prior experience in Head Start (including full year preschool or 
Head Start, summer Head Start, or less-than-full-year Head Start pro- 
grams). Previous Head Start experience was positively related to pre- 
scores on several cognitive variables (higher scores on measures of 
intelligence, school readiness, cognitive style , achievement motivation) 
and social-emotional vstriables (fewer motivation problems and less 
frequent feelings of inadequacy.) However, there was no difference 
between children who had had previous Head Ststrt experience and those 
who had not on either the incidence of behavior problems or social 
isolation. 

The effects of prior Head Start were only partially replicated in 
the 1967-1968 data: children with prior Head Start were more verbal 
and engaged in more social interactions than children without such 
experience, but they were also less well-adjusted in the testing situa- 
tion. No difference occurred on measures of general intelligence. The 
disparity between the 1968-1969 and 1967-1968 data is probably accounted 
for by the longer average exposure to Head Start among "experienced" 
children in 1968-1969. 

It should also be mentioned that some inconsistent relationships 
between children's ethnicity and prescores emerged but will not be 
stmnarized here since no meaningful interpretations can be made. These 
patterns were not stable across measures, did not replicate across years. 
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h!^;.**"' 19*7-1968 data, and were confounded with several other factors 
(region, age and socio-economic status). lactors 

closeir^S?;^ '■^^^ tackground characterisUcs were more 

auS^rac1e?i^^S?'\Sh:?" r^.rZVZrZ'r.Z' maie 

. V.^"^ parental aspirations and expectKions Sre hiRh tSe 
mother felt confident atout life, adults were accessiSe thP chiiH 
read to freauentiv ar,,^ u "'^t'^ accessioxe, the child was 

Lo irequentiy, and little physical punishment was used. 

in HeJ^i^r^^rf^^J children who had had some previous experience 

What Were the Head S tart Programs r.-l Irp9 

=:ir:HSc^ ^^^^^^^^ i.vssVSiV--r - 

^aiBlns in early cMldhcod eduo^tTn^M" Ae':? 5?)!"'>S!r 
their Head Start positions (52^ had none). ^P^cial training for 

^ jl33sroom orientation characterlst i n... . Teachers' <,p1-f 
"^^'^''mSh.."' J?^'" ^^^^^^^ ^^^^-^ eclec^ic^^ldfce^te^^r 

centered. ^"^^"^^^y child-orlented, and only weakly task- or parent- 
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TABLE k 



Level of Program Focus Weak Moderate Intense 

Child-centered 19?S 3^% ^7% 

Task-centered 7^ ^9% ^ 

Parent-oriented 96^ ^% 0% 
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Teachers also reported on their specific program goals and 
emphases J language (defined as provision of organized instruction 
sessions); child socialization (defined by cooperative play, trust 
toward adults, participation in small groups); independence and self- 
care (defined by speaklng-up for one's own rights and tending to one's 
own physical needs). Although classes varied greatly in program 
emphases, relatively few children attended classes having very strong, 
well-articulated program emphases as indicated by the relative percent- 
ages of children, as shown in Table 5. 

Trained observers' ratings of ongoing classroom activities largely 
substantiate the teachers' self-reports. The most frequently observed 
activities focused more on global aspects of child development than on 
specific behaviors or skills. These activities, from most to least 
frequent, included the followingi motivational development, self- 
esteem development, academic skill acquisition, concept development, 
group and social skills acquisition and last, language development. 

According to observers, classrooms also differed substantially in a 

dimension that was related to implementing special program emphases 

the availability and adequacy of various types of equipment (see Table 
6). 

As shown in Table 6, most children attended classes which had the 
equipment (large muscle to encourage gross motor activities such as 
play, but had less equipment appropriate for specialized activities in 
cognitive development or learning. 

Summary. Teachers' self-reports and trained observers' ratings 
were in agreement with respect to the general personal-social develop- 
ment orientation that characterized most Head Start classes. Structured 
cognitive development activities and language instruction were rarely 
obseirved, extensive equipment for these activities were rarely present, 
and strong, well articulated program emphases on cognitive or linguistic 
attachiaent were seldom reported. This picture is consistent with the 
view of Head Start as a broadly based child development prog:ram ; it does 
not reveal an intensive compensatory education program aimed at cognitive 
deficits. 

There is alKays the danger that even observational data, carefully 
obtained and reported, may fail to capture the essence of the classroom 
experience or the more subtle but critical variables that define a 
program orientation. Anecdotal records by the observers suggest that 
this may sometimes be the case. For example, compare these two reportss 

On the whole. Teacher X had a disciplined and quiet 
class. She had a rather formal view of education, 
a mini -grammar school atmosphere ... She emphasized 
children's ability to communicate, giving them tasks 
such as to describe to the class what they had done 
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TABLE 5 



Level of Program Emphasis Lower Middle Upper 

Language instruction 5S% 3^5? . 

Child socialization 49% 38% 139^ 

Independence and self -ceore 8^ 10% 2% 



41 



36 



TABLE 6 



Availability and Adeqiiacy of Equipment Low Kiddle High 



Cognitive materials 23J6 59j6 i8jg 

Large muscle 355 36^ i^j^ 

Sensory-motor 17^ 
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over the weekend . . . She seems to feel that her job 
as a teacher ends after she has taiaght the children 
their A's, B*s, and C*s in a group, and her job as a 
supervisor begins when they axe creating things or 
working on their own. 

The children in Miss F's Head- Start class seemed a 
lot happier thaxi children in other centers which I 
observed. With the exception of about two children, 
the teacher-child relationship was excellent. The 
childiren loved their teacher • . • The best thing 
about this class seems to be the mutual love between 
all the children. - All the children loved each other 
and they all interacted with each other. 



These anecdotes illustrate, among other things, the degree of 
variation among Head Start classes on several unmeasured interT)ersonal 
dimensions, and support the claim that Head Start is many programs, not 
a single program. There is certainly no evidence of a single educational 
"treatment" (that is given to all children) in the sense that Sesajne 
Street is a "treatment," The anecdotal data further point out that the 
nuances of teacher style and classroom atmosphere were barely glimpsed 
in either the observer forms or teacher self- ratings. Despite the 
subtle, and sometimes not so subtle, variation in teacher style, it is 
fair to say that most sampled Head Start classes in 1968-1969 were 
oriented in a nonspecific way to general personal -social development . 
It is the effectiveness of this whole-child approach that is reflected 
in the performance gains, and variations wi thin this approach which are 
associated with differential gains for some children. 



POST-TSST PERFORMANCE GAINS 



Did Children Gain From Their Head Start Experience ? 

The question addressed in this section asks whether Head Staxt 
children, taken as a whole, benefitted from their Head Start experience. 
The answer is yes, in many areas . Those areas in which children showed 
consistent and statistically reliable gains are summarized in Table 7; 
those areas in which children's participation in Head Start did not 
make a difference, i.e., bring about a change in performance, are 
summarized in Table 8. 

Areas showing significant change . Head Start children showed the 
greatest change or most substantial improvement in areas related to 
cognitive and social functioning and specific skills ~ achievement 
motivation, social adjustment, school readiness and academic achieve- 
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TABLE 7 



Pre-Post Changes in Performance for 1968-69 Sample 
Measures Showing Substantial Change* 



Measure 



Number Perfor- Distribu- Mean 
Children mance tion of Score 
Tested Score Children 



Direction of 
Change 



Pre Post Pre Post 



Stanford-Blnet {lk66) 

m 

(Cognitive 
Ability) 



Preschool 
Inventory - 
Staindaxd Score 
(School 
Readiness) 

Preschool 
Inventory - 
Sub-Tests 
(Academic 
Achievement) 



Animal House- 
Standaxd Score 
(Ability to 
Learn) 



(1232) 



(1233) 
(1231) 
(1229) 
(1228) 
(1183) 



Hertzig-Birch- (l272) 
Verbal Score 
(Cognitive Styl e ) 

Factors 

Affecting Test 
Performaaice 
(SociaJL Adjus tmen t ) 

Gumpgookles- (982) 
Standaird Score 
(Achievement 
Motivation) 



101+ 
90 to 
100 
89 or 
lower 

12i+ 
101 to 
120 

100 or 
lower 

personal 
social 
vocabu- 
laxy 
numeri- 
cal 

sensory 

12+ 
9-11 
8 or 
less 



22% 31% 
27% 30% 

51% 39% 



31% 5^ 
37% 3^% 

32% 12% 



11% 2hffo 
8^ 22^ 
k% 139S 
19^ 50^ 
13^ 19^ 

3% m> 

Gj% W 



high 3% 52% 

middle kl% 

low 1S% 7% 

very 7% SG% 
high 

high 66% 2J% 

moderate 27% 11% 



high 
a verage 
low 



6% 70% 
21% 19% 
73% 11% 
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89.5 9^.1 



109.9 120.3 



10.7 12.9 
5.3 7.3 

6.0 7.7 

11.6 lJ^.8 
8.5 9.2 

56.^ 61.2 



3^A ki,6 



+ 
+ 
+ 



TABIE 7 (Cont.) 



Measure 



Number Perfor- Distribu- Mean 
Children mance tion of Score 
Tested Score Children 



Direction 

of 
Change 



Parent 

Interview- 

(Attitudes 

toweird 

Education) 



Pre Post Pre Post 



(879) 



very 3^-% ^% 
favorable 
favorable 33^ 
moderately 
favorable 
or less Z% 



125.^ 1^2.1 



♦Taken from SDC finail report, 1972; Achievement Motivation (Gumpgookies) 
data from RTI final report, 1972. 



TABLE 8 

Pre-Post Changes in Performance for I968-69 Sample 
Measures Showing Little Change* 



Measure 



Nmber 



Perfor- Distribution Mean Score 



Children 


mance 


of 








Tested 


Score 


Children 










Pre 


Post 


Pre 


Post 


(1272) 


96 to 100 












91 to 95 


3^ 


36% 








90 and 


^0 


ko 








below 






90.3 


90.7 


(1272) 


high 












moderate 




k2% 








XU n 




31% 


11X« 0 


111 • 7 


(1275) 


hais many 


\J\J/0 


90s; 








friends 












nas some 


20% 


21% 


9^.1 


93.8 




friends 








(876) 


100^ 












favorable 


89^ 


92% 


194.8 


195.7 


(887) 


high 


28^ 


27% 








moderate 




Wo 








low 




2Wo 


123.6 


122.8 


(887) 


moderate 


1?% 


Wo 








or higher 












low 


Wo 


3^0 








very low 


Wo 




9.4 


9.3 


(881) 


19 or 




35% ■ 








higher 












11-18 


62% 


58^ 








(moderate ) 










0-10 


6% 


7% 


16.6 


16.8 




(low) 











Hertzig-Birch- 
Work Score 
(Cognitive Style) 



Hertzig-Birch- 
Spontaneous 
Responses Score 
(Cognitive Style) 



Isolate Score 
(Sociability) 



Paxent Interview 
(Attitude toward 
Head Start) 

Paxent Interview 
(Feelings of 
Personal Power) 

Parent Interview 

(Community 

Involvement) 



Parent Interview 
(Paurent Alienation- 
high is more 
alienated) 



*Taken from SDC final report, 1972, 
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ment. Statistically significant, but less substantial gains were noted 
in areas related more to basic cognitive abilities — intelligence or 
generaJL cognitive ability and learning ability. 



The largest gains were made in achievement motivation and social 
adjustment. At the time of initial testing, only T% and 6%, respec- 
tively, of the children performed at ''favorable'' levels; in Spring of 
1969, 70?o and 66?o, respectively, performed very favorably on these 
measures. The performance increment or gain for children receiving 
very favorable scores was a substantial G^/o and 6l?o. 

The next largest gains were reported for skills associated with 
scholastic readiness: a Yi% gain for children obtaining high scores 
on verbal skills and a Zy/o gain for children obtaining high scores in 
school readiness At the time of entry into Head Stairt, only J\% of 
the children demonstrated skills usually predictive of successful 
school perf ordnance ; in Spring, 1969 1 ^% demonstrated such skills. 

By contrcist, gains in basic cognitive and learning abilities, 
although statistically reliable, were smaller in absolute magnitude. 
The performance increment, again in terms of gain in the percent of 
children achieving very favorable scores at Spring re-testing, was only 
9^ on the Binet IQ test and G% on the test of ability to learn. It is 
worthwhile to mention that the absolute LQ gain of 5«6 points, and the 
final mean IQ level of 9^.1i are similar to those reported in other 
studies of the effects of traditional preschool experiences on dis- 
advantaged children (Bissell, 1971; Stanford Reseeirch Institute, 1971; 
Steams, 197l)» And, existing evidence supports the axgument that gains 
of this magnitude axe best accounted for by motivational rather than 
cognitive changes associated with these experiences (Zigler and 
Butterfield, 1968). Although the final level of performance reported by 
this national evaluation is still below the national average (about 
6 points below) assessments of individual Head Start prograjns have 
reported larger gains and final levels closer to the national average 
(Ryan, 197^). 

It seems likely that the changes on cognitive and social measures 
presented here are "psychologically meaningful" in the sense of 
reflecting real phenomena as distinct from testing and regressing 
artifacts. To illustrate this point, compare the very laxge gains in 
initially high levels of achievement motivation (from 6^ to 70^) with 
the less substantial gains in initially high scores on tests of school 
readiness (from 13^ to 54^) or tests of the ability to learn (from Yf/o 
to 19^) . Although ample room for improvement was available on these 
measures, the magnitude of gains varied considerably from one to another 
suggesting that the gains do not simply reflect a tendency towatrd 
greater improvement where greater room for improvement exists. 

Areas showing little or no chanp:e . Table 8 summarizes the results 
for those measures which did not change during the Head Start program. 
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Many of these no-change measures tapped pairent attitudes. It appears 
that while parents' attitudes toward education improved from "favorable*' 
to **very favorable'', parents' attitudes vis-a-vis society did not chajige; 
no significant improvements were found in parents* feelings of personal 
power (predominantly externally and not internally controlled), of 
reported community involvement (predominantly low or very low), or of 
alienation (predominantly moderate or high). Thus, it does not appear 
that Head Start affected how parents felt about their own chances or 
place in society, although there was ample room for improvement on all 
measures of these attitudes. These findings should not be interpreted 
to mean that Head Start has no effect on parents' lives; many SLnecdotal 
reports of parents tell of dramatic changes from fearful, isolated 
people to dynamic community leaders and competent self-assured indivi- 
duals. On the other hand, the anecdotal reports may be somewhat atypical. 
Another caveat to bear in mind is that the durability of changes in 
parent attitudes after termination of the Head Start experience is 
unknown at this time. 

Also depicted in Table 8 are seversuL measures of children which 
did not show reliable change. Two of these were virtually already at 
"ceiling level" (i.e., did not leave much room for improvement); 
measures of the child's willingness to work on a test such a^ the Binet 
and peer sociability (non-social isolation), suggesting that these are 
not among entry-level problems of Head S.tart children. 

Summary . Overall, then, children began Head Start low in achieve- 
ment, in scholastic readTness smd general learning ability. At the 
end of six to eight months, in Spring 1969, they were highly motivated, 
very well-adjusted to the school situation, had acquired satisfactory 
levels of school-readiness skills, and showed modest gains in general 
learning; ability. Their parents, moreover, had very favorable attitudes 
toward education (20% increment in parents with very favorable attitudes 
toward education) , 

Do these gains reflect the Head Start program emphasis on personal- 
socia.1 development reported by teachers and observers? Or do they 
indicate that basic cognitive abilities axe much more difficult to 
influence than motivation? The clearest way to resolve these questions 
would require carefully controlled experimental studies in which initial 
levels of entry characteristics were varied systematically with contrast- 
ing program emphases. Then, if a particular program emphasis produced a 
particular pattern of gains — e.g., a personal-social program emphasis 
produced the largest personal-social gains, but smaller cognitive gains, 
and d- contrasting progrstm emphasis produced an opposite pattern of gains 
it could be reasoned that progrsan emphasis accounts for differences in 
the magnitude and type of gains made. On the other hand, if the Sctme 
pattern of gains resulted from contrasting program emphases — e.g,, for 
all programs the largest gains would be reported for motivation; next 
largest for specific skills; smallest for basic cognitive abilities — it 
could be reasoned that program emphasis is irrelevant to the type, if 
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not magnitude, of gains made. It is unlikely that ctny experimental 
study would yield such cleatr cut findings, and permit drawing such 
unambiguous conclusions ateut a complex issue of this sort. It is even 
less likely that the loosely controlled national evaluation studies 
summarized in this report will resolve completely the issue of whether 
-••currioJlum" or program type makes a difference to the gains obtained in 
early intervention programs. However, the results of these evaluation 
studies should shed considerable light on this issue. 

Did Some Children Gain More Than Others? 

Overall, the answer is ^es. On the Stanford-Binet IQ test, for 
example, changes ranged from losses of more than seven points for 9.5^ 
of the children to sizeable gains of more thctn 18 points for 9.2?g of the 
children. The remainder of the data analyses were addressed to identify- 
ing the factors associated with these differences. Three kinds of 
influencing factors were examined and will be discussed in this reports 

"-background characteristics of the children and their families 
— chaxacteristics of the teachers and programs 
— interactions or unique combinations of child/family and 
teacher/ program chaxacteristics. 

Statistically significant and reliable (p C .01) changes associated 
with these factors will be presented and discussed. The main data base 
will continue to be the 1968-69 national evaluation study and where 
appropriate and possible, reliable relationships from the 1967-68 and 
1966-67 data will be mentioned. 



Which Child and Family Characteristics Are Associated with the Greatest 
Gains In Performance? 

The pu2T)0se of this section is to explore the issue of whether 
different types of children make different kinds of gains in the Head 
Start program. If children do vary on a particular characteristic, 
for example age, vary systematically in their performance gcins (i.e., 
gains vary in relation to differences in age), it can be reasoned that 
age of Head Start children makes a difference to the gains they will 
make. On the other hand, if the differences in children's age are not 
systematically related to performance gains, an argument can be made 
that age differences may not be an important determinant of the gains 
made in the Head Staxt program. 

The characteristics of children and families measured in these 
studies are listed below with the relationships that were found between 
these chaxacteristics and performance gains. 

.Age. Younger children (under 60 months of age) gained more in 
the area of social adjustment; older children gained more in specific 
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skill axeas (e.g., numerical). Younger children also luade greater gains 
on the Stanford-Bine t test of general cognitive ability in both 1966-6? 
and 1967-68 data, but curiously, this pattern was not replicated in the 
1968-69 data. 

.Sex. Whether the child was male or female was unrelated to 
performance gains on all measures. Not only were sex differences un- 
related to gains, but recall as well the lack of significant relation- 
ships with initial performance levels. These findings fall to support 
the assertion that the all-female teaching staff is associated with the 
unfavorable program experience often reported for boys. Several interes- 
ting and unanswered questions are suggested by this intriguing no- 
difference finding and perhaps warrctnt further probing in subsequent 
statistical analyses. For instance, did the male volunteers (whose 
performance was not analyzed separately) contribute to producing equal 
benefits for boys and girls? Were teacher aides, many of whom come from 
the community, responsible for the equality of boys' and girls' experien- 
ces? 

12/ 

. Ethnicity . — ' The relationships between ethnicity and gains were, 
on the whole, different for different measures, presenting no consistent 
picture or pattern. For example, on the Stanf ord-Binet, Black children 
made the greatest gadns and the Mexican-American children the smallest; 
however, no significant differences between ethnic groups on Stanford- 
Binet gains were reported in 1967-68. And, on measures of school 
readiness, Polynesian children made the greatest gadns. These findings 
should be interpreted with caution, however, since the Mexican-American 
and Polynesian samples were small (l39 and 83 children respectively) 
and the data are confounded -- Mexican-American and Polynesian children 
differed from their Black and White Head Start counterparts in age as 
well as ethnicity. 

13/ 

.Initial IQ.-^ Pretest scores on the Stanf ord-Binet were related 
to gains on several measures. In both I968-69 and I967-68, children 
with initially low IQ scores (less than 85) showed the greatest gains 
on the Stanf ord-Binet. It is important to note that this relationship 
was linear, with medium entry IQ*s (85"95) associated with moderate 
gains and high entry IQ's (over 95) with small gains. The same linear 
relationship was also found foi gains on measures of school readiness 
(1968-69). 



12/ The relationships between ethnicity and gains were analyzed only 
by Research Triangle Institute. 

Findings concerning the relatioships between initial IQ and gain 
scores are based on analyses performed by System Development 
Corporation. 
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The strong relationship between initial IQ and IQ gain scores can 
be partially accounted for by the measurement artifact of "regression- 
toward-the-mean" : extremely low scores may be spurious due to fatigue, 
distraction, etc. and, on subsequent re-testing, individuals who scored 
extremely low are likely to score higher, thus raising the mean of the 
group. However, as concluded in the final report of these evaluations 
(SDG, 1972), the differential gsdns made by the low IQ group seem to 
reflect more than regression artifacts because: (a) the low initial IQ 
group had the lowest scores on all pretests except social isolation, 
indicating that placement of a child in this group was not due to 
random error of measurement on one pretest measure alone, and (b) the 
low initial IQ group gained most on nearly all performance measures. 
The strength of this argument is diminished by the fact that the 196?- 
1968 data do not show this extensive and consistent pattern. The 
influence of initial IQ is best interpreted, therefore, partly as a 
resxilt of statistical regression and paxtly as a result of genuine 
differential gain. 

. Prior Head Start Experience . Children with no previous Head 
Start experience and children with lengthy previous Head Start experience 
showed greater gsdns in preschool academic achievement and ability to 
learn a new task than did children with short previous exposure to Head 
Start. This finding is difficult if not impossible to interpret; it may 
reflect the difference between summer and full-year Head Staxt, or it 
may be a result of factors confounded with previous Head Start atten- 
dance • 

. Urbanicity . On a variety of measures, especially of cognitive 
functioning, children from non-urban communities showed greater gains 
than urban children. In I967-68, these differential gains occurred only 
in the social domain. However, non-urban children also differed from 
urban children in their initial scores on these measures. Thus, although 
these results are tantalizing, no unajnbiguous conclusions can be drawn 
about the effects of urbanicity, since these data axe confounded. 

. Region . The relationships between geographic region and gains 
were Inconsistent between 1967-68 and 1968-69, probably reflecting 
experimentally Induced changes in the Head Start program in the South 
(1968-69). In 1967f non-Southern children showed larger gains in both 
cognitive and social-emotional development than did their Southern 
counterparts (the difference in cognitive gains was only marginally 
significant). However, in I968, after many Southern Head Start centers 
were converted to structured academic programs. Southern children showed 
greater gains than non-Southern children on measures of verbal respon- 
siveness, specific academic skills (numerical), and cognitive function- 
ing (again marginal in the cognitive area). It seems likely that region 
per se is not an important determinant of gains, but that the program 
variables (e.g., academic orientation) and child characteristics (e.g., 
age) that are confounded with geographic region in this study may indeed 
be related to gains in a psychologically meaningful way. 
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few significant relationships between socio-economic status and gains 
were reported. Nonetheless, there was a consistent ("but not 
statistically significant) tendency for the most deprived" children 
to show the greatest gains. A single isolated exception was the 
significant positive relationship between the mother's educational 
level and the child's ability to learn a new task. This finding was not 
replicated in 1967-68, however, suggesting that within the Head Start 
population, relative socio-economic status is not a determinant of gains. 

. Other Family Variables . Unrelated to performance changes made 
during the Head Start program were the host of family background 
characteristics, which, by comparison, were strongly related to initial 
performan ce levels . These characteristics included: family structure, 
parental employment, family mobility, maternal aspirations and expecta- 
tions for the child's education, parent pessimism, accessibility of 
adults in the home, frequency of reading to the child, .parent mode of 
physical control. 

Summary. Although some child and family background characteristics 
are meaningfully related to performance gains, many more appear unrelated 
°K ^^'^l iriconsistently related to gains. Child's sex and most family 
Characteristics including socio-economic status were unrelated to gains; 
child's previous Head Start experience, ethnicity, urban! city and 
geographic region were only inconsistently related to gains and were 
confounded with other variables such as program orientation, child's 
age and prescores. The strongest and most consistent relationships 
emerged between child's age and pretest IQ and gains on measures of 
social and cognitive development. However, the very strong relation- 
ships between pre-test IQ and gains are tempered by testing artifacts. 
Two important conclusions can be drawn from these findings; first, that 
the family characteristics measured in these studies are not important 
determinants of the gains the child makes during the Head Start program 
although they are very critical determinants of the child's perfoimance 
level at the time of entry into Head Start; and second, that, with the 
exception of age and initial IQ, child background characteristics as 
measured in these studies also do not appear to be important determinants 
of gains made during the Head Start program. The relationship of other 
kinds- of characteristics (teacher/program) and combinations of charac- 
teristics (child/family and teacher/program) to gains scores will be 
explored in subsequent sections. 

Which Teacher and Pro gram Characteristics Are Associated with the 
Greatest Gains in Performance? ' 

Thiis question will be approached in two ways: first, in terms of 
specific relationships between various program and teacher characteris- 
tics and children's gains; and second, in terms of the program inputs 
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associated with more broadly defined child outcome areas. The pxirpose 
of orgaLnizing the findings this way is to ensure that children's gains 
are viewed from the two perspectives of program inputs and child 
development outcomes. 

How teacher/program characteristics are related to gains . In this 
section the following categories of teacher and program characteristics 
will be discussed: 

— teachers 'behaviors in the classroom 

— cleissroom activities 

— cleissroom materials 

— aspects of classroom atmosphere, 

1, Teacher's Background 

Teacher's background characteristics were assessed by means of a 
structured teacher interview. The most significant background charac- 
teristics in terms of implications for early childhood education, were 
the teacher's education and prior experience with disadvantaged children. 

Teacher's age . Children whose teachers were older (over 28 years 
of age) made greater cognitive gains than did children in classes with 
younger teachers. 

Teacher's education and experience . Children whose teacher's 
level of general education and extent of her prior experience with 
disadvantaged children were relatively high performed less well on 
measures of preschool achievement and socieil adjustment than children 
having teachers who appeared to be less well qualified. This puzzling 
finding requires some further explanation and in so doing, will draw 
upon additional data. 

In 1967-68, teacher's education and experience were also negative- 
ly related to child outcomes, but in the domain of social interaction 
with peers. Separate ajialyses for Southern and non-Southern children 
revealed, however, that the effects of teacher's experience (but not 
education) were regional. In other words, the negative relationship 
between teacher's prior experience and cognitive perf'ormance held only 
for non-Southern children, suggesting possible confounding with other 
factors. Moreover, there is some indication (from the RTI report, 
1968-69) that specific training in early education was actually positively 
related to gains in school readiness and learning ability, but these 
effects were, unfortunately, weak and small. 

Certification requirements for early childhood teachers have 
been among the most controversial policy areas. The negative relation- 
ship between the years of general education and benefits to children 
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seems to support the view that paraprofessionals can relate better to 
Head Start youngsters. However, proponents of this view should also 
take into account the pervasive lack of prior paid experience with 
disadvantaged children among Head Start teachers during 1966-69. It 
seems likely that not the years of general education per se, but the 
lack of relevant prior experience coupled with it, may account for this 
curious negative relationship between teacher's qualifications and 
children's benefits. The data are most consistent with the position 
held by many early childhood educators that education and experience 
in early childhood education rather than general education alone, are 
essential preparation for preschool teachers. However, too few 
teachers in 1966-69 had degrees or training in early childhood education 
for the analyses to be more than suggestive. Future analyses of the 
1971-72 Planned Variation data and evaluative research on the new 
"credentialling-by-accomplishment" Child Development Associate Program 
will be valuable in resolving the question of what training is associated 
with teacher competence and child benefits. 



2. Teacher's Approach and Program Emphasis 

Differences among teachers' approaches and program emphases, as 
reported by the teachers themselves, were related to a variety of 
outcomes. The relationships are summarized below. 

Method of control. Children having teachers who used non-physical 
control gained more on measures of cognitive abilities and school 
readiness than children with teachers who used physical control to 
maintain discipline and order in the classroom. 

Teacher's style - co pnltive aspects . Children whose teachers 
stressed providing the children with quality cognitive inputs were not 
differentially affected; however, the parents of these children showed 
an increase in their feelings of alienation; 

Emphasis on independ ence and self -care . Children whose teachers 
reported high emphasis on goals of independent and self-care showed 
greater gains in cognitive abilities, school readiness, and social 
adjustment than children whose teachers reported a lower emphasis on 
these goals. 

Emphasis on socialization. Children whose teachex^ emphasized 
child socialization goals showed better social adjustment in the 
testing situation than children whose teachers did not emphasize these 
goaxs • 

Emphasis on lan^uafie. Children whose teachers emphasized language 
development gained less on measures of cognitive abilities and school 
readiness than children whose teachers did not emphasize language 
development. This finding is problematic unless viewed in the context 
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of the classroom olDservation data. Actual observations revealed that 
high use of laingua^e materials was associated vriLth greater cognitive 
gains, as would be expected. It appears there is some discrepancy 
between teacher' s self reported program., emphasis on language and actual 
observations of ongoing program activities. 

3- Teacher's Behavior in the Classroom 

Several characteristids of the teacher's behavior in the classroom 
were differentially associated with children's gains. These character- 
istics axe described in detail in Appendix A as the teacher factors 
derived from the 1968-69 data collected with the observational instru- 
ment (OSCI - Observation of Substantive Curricular Input). Teachers' 
behavior was, on the whole, significantly related to gains in many areas, 
as indicated by the relationships summsaized below. 

Social-emotional interactions with children . Children whose 
teachers engaged in more social -emotional interactions did not perform 
as well on measures of the ability to learn a new task ets did children 
whose teachers exhibited less of this behavior. 

Structured lessons . Children whose teachers structured lessons to 
a greater extent, showed greater gains on measures of learning ability 
and school readiness. 

Art activities . Children whose teachers provided more art-related 
activities showed greater gains in school readiness. 

Creative small-CToup instruction . Children whose teachers used 
middle and high levels of creative small -group' instruction activities 
(e.g,, visual motor activities) showed greater gains in cognitive 
ability, school readiness and the ability to leeim a new task than 
children whose teachers provided only low levels of such activities. 

Rote learning . Children whose teachers placed a relatively low 
emphasis on rote (receptive) learning showed greater gains on measures 
of learning ability. 



Classroom Activities 

Characteristics describing classroom activities were also derived 
from the OSCI observational instrument. Compared with teachers* 
behavior, however, classroom activities (see Appendix A, 1968-69, OSCI 
class factors) appear to be less closely related to children's gains. 
In fajt, several factors (the presence of group activities in a social 
context; social-emotional interactions across contexts; and small group 
ci:v?Ative activities) were altogether unrelated to gains on any outcome 
measure. Classroom activities that were differentially associated with 
gains are summarized orieily below. 
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structu red lessons . Children in relatively ''structured" classes 
(e.g., programmed materials and control by head teacher) showed greater 
gains in school readiness and learning ability. 

Verbal c ommunication . Children in classes characterized by fairly 
extensive verbal communication across contexts gained less on measures 
of learning ability and achievement motivation. 

Language and discrimination learning . Children in classes charac- 
terized by frequent occurrence of language and visual discrimination 
activities and use of learning materials gained more in general cogni- 
tive ability but less in learning ability. 

Parent participation. Children in classes where parents participa- 
ted in the program (e.g. , as defined by the number of visits to the 
Head Start center) showed greater gains in the area of social-emotional 
adjustment. The relationship was weak, however, and represents only a 
very token measure of parent involvement. 



5. Classroom Equipment and Materials 

The physical attributes of the classroom were yet another aspect of 
Head Start programs measured in the 1968-69 and 1967-68 studies. 
Classroom equipment and materials were related to gains, but the data 
suggest two possible interpretations! first, that classroom equipment 
and materials are incidental indices of the physical quality of the 
program; or second^ that the equipment and materials themselves are 
psychologically related to the gains reported. The first interpretation 
is most fitting in the case of large muscle equipment, descHbed below. 

Large muscle equipment. Children in classrooms with access to 
'^tfff4."^^^"^® equipme^rt(eTg. , jungle-gyms) gained more in cognitive 
ability and school readiness than children in classes where such 
equipment was less accessible. However, observed high frequency use of 
this equipment was associated with smaller gains in cognitive ability, 
fewer social interactions among children, and less favorable parent 
attitudes about the perceived benefits of Head Start. While the presence 
of this equipment is associated with effective programs, excessive use 
of the equipment may divert the children from activities more likely to 
enhance their cognitive development. 

Small muscle equipment. Children in classrooms where small muscle 
equipment was available and extensively used showed increases in the 
frequency of social interactions initiated with peers (1967-68 data). 

Cognitive materials. Children in classrooms where cognitive 
materials were available and extensively used, showed greater improvement 
in their task orientation in the testing situation. 
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Dramatic materials . In classrooms where dramatic materials were 
used extensively (and considerable verbal Interaction probably occurred) 
children showed comparatively greater Increases In social Interactions 
and parents revealed more favorable perceptions of the benefits of 
Head Start. 



6. Classroom Atmosphe 

Classroom atmosphere Includes characteristics which reflect more 
qualitative and experiential aspects of the classroom environment, and 
which have little to do with programmatic Inputs. Nonetheless, such 
characteristics are related to differences In gains. 

Student and teacher turnover rates . Children In classes where the 
student turnover rate wa^ relatively low shovrcd greater gains In cogni- 
tive ability, social interactions with peers, and social adjustment 
than did children in classes with high transiency. On the other hand, 
children who had experienced a high degree of teacher continuity also 
showed greater improvement in their social: adjustment . This pattern 
suggests that children's social acLjustraent iri stressful situations 
(specifically, testing) is facilitated by a changing peer context, but 
within a relatively stable teacher-child context . 

Length of exposure to Head Start , While there were no substantial 
relationships between pre-post interval length and gains, longer inter- 
vals (defined as 2? or more weeks) were associated with greater cogni- 
tive gains and improvements in social adjustment and behavior problems. 
These findings suggest that program effects are cumulative, at least 
within the time span of the full -year Head Start program. 



Which Progx;am Inputs Are Associated with Specific Outcomes ? 

The numerous and often complex relationships between teacher/ 
program characteristics and child outcomes presented in the previous 
section of this report will be restated below in more general terms by 
clustering the program inputs -around areas of groupings of child develop- 
ment outcomes. 



Data pertaining to student and teacher turnover rates are taken 
from 1967-68 and 1966-6? final reports, System Development 
Corporation; data pertaining to length of exposure to Head Start are 
taken from the 1967-68 and 1968-69 final reports. Research 
Triangle Institute. 
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1, Preschool Achievement 

In general, children gained most in classes where; 
.the teacher was older 

.the teacher had less formal education and experience 
•the teacher did not use physical control 

• there was high or moderate emphasis- on independence and self- 
care 

• there was high emphasis on structured lessons 
•there was moderate emphasis on small group activities 
.there was ample large muscle equipment available but not 

excessively used 

•there was high emphasis on art activities 

2. General Cognitive Ability 

In general, children gained most in classes where s 
.the teacher was older 

•there was moderate emphasis on small group activities 
•there was low emphasis on rote learning 
•the teacher did not use physical control 
•there was high emphasis on independence and self -care 
•there was ample large muscle equipment which was not 
excessively used 

•there was moderate use of cognitive materials 
.despite the teacher's self -reported low emphasis on language, 
language materials and activities were frequently observed 

The value of moderate and high levels of cognitive structure and 
activities for cognitive gains was replicated in the 1967-68 data as 
well^ 



3. Learning Ability 

In general, children gained most in classes where i 
•the teacher was older 

•there was a low emphasis on teacher's social-emotional inter- 
action 

• there was high emphasis on structtured lessons 

• there was low emphasis on rote learning 

• there was low emphasis on verbal communication 
•there was moderate emphasis on small group activities 

•there was low emphasis on language and discrimination learning 
tasks 
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Social Adjustment 



Social adjustment, as an area of outcomes, encompasses adjustment 
to the testing situation, reduction in the number of behavior problems, 
and improvement in motivation and self-esteem. In general, children 
gained most in classes where: 

• the teachei' had little formal education or previous experience 
•there was high emphasis on independence and self-care 
•there was high emphasis on child socialization 
•there was a high level of teacher continuity and a low 
pupil/teacher ratio 

The clusters described above show how teacher and program inputs, 
including teacher's educational background, approach and classroom 
behavior, were related to children's gains in all outcome areas^ 
Classroom activities suid materials, independent of the teacher's input, 
were on the whole less extensively related to child outcomes, particularly 
in the area of social adjustment. Cognitive gains were influenced more 
by cognitive-oriented progr^im emphases and classroom activities; social 
gains were influenced more ly social-oriented program emphases and by 
qualitative aspects of classroom atmosphere • 



Do Different Teacher and Program Approaches Influence Different Children 
in Different Ways? 

Experts in the field of early childhood education disagree on the 
definition of a good preschool progi-am^ According to Weikart (1971), 
and! Weber (l97l), curriculum is less critical to child outcomes than 
the non-substantive characteristics of the program such as administra- 
tion, planning, and supervision^ Some (Bissell, 1970; Kohlberg and 
Mayer, 1971) have identified global substantive program attributes 
(e^g^, program structure; program emphasis on facilitating stages of 
cognitive development) that seem effective in a broad sense, in facilita- 
ting child developments Still others (e^g*. Miller, 1971) have found 
that different programs produce different effects on different children 
~ interactions — - a finding that supports the position held by most 
early childhood educators that programs should be "matched" to child 
characteristics . 

Analyses of the Head Start national evaluation data do reveal 
interactions between program and child characteristics •15/ A program or 
teaching approach associated with superior performance for one group of 
children may not necessarily be eissociated with superior performance for 
another group; and in some instances, program inputs that are positively 
related to gains for one group may be negatively related to gains on 

137 The interactions summarized in this section of the paper were 
reported by SDC only, SDC analyzed the data using an approach 
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other measures or gains for another group of children. To restate the 
point, interactions are oftentimes complex inter-relationships between 
two kinds of characteristics, in this case program and child characteris 
tics, aind outcomes. 

Teacher /progra m approaches associated with gains for different 
groups of children. In this section interactions between program and 
child characteristics will be presented to highlight the particular 
program approaches or teacher characteristics which were found to have 
positive effects on certain subgroups of children within the Head Start 
population tested, 

.Teacher's use of non-punitive methods of control or restraint 
from physical punishment, was most consistently related to 
gains in cognitive abilities and preschool achievement for; 

—children with initially high IQ's (over 95) 
—relatively younger children (under 5 years of age) 
— urban children 



.Moderate to high levels of program emphasis on goals of indepen- 
dengg_and self -care were positively associated with gains'Tn 
cogniUve abilities, achievement and social adjustment for: 

—children with initially high IQ's 
—younger children 
—urban children 



.Moderate levels of program emphasis on cMld socialization ^n.i. 
were associated with gains in achieveme nt for; ^ 



— cn:idren vfith initially high IQ's 
—younger children 



ly (cont.) especially sensitive to detect i no- in+o.n^^4.^ 

by RTI is consistent ^?ih f T^"" programs and gains reported 

y consistent with the interactions suimnarized in this report. 
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.High levels of pJ^^gram emphasis on lan^uag_e development were 
found to have ini^^d and even reversed, effects; a strong 
program emphasis ^as positively associated with gains in 
achievement and cognitive abilities for children with initially 
low and mid-level Ift's^ but negatively associated with cognitive 
gains for childar^^^ with initially high IQ's. 

For younger chil<i^en, the strong language development emphasis 
negatively rel^t^^ to gains in both cognitive abilities and 
cognitive styles* Butj conversely i a strong langioage develop- 
ment emphasis va^ ^soci^ted Hith achievement gains and cogni- 
tive style gains ^Or older children. 

• The quality of j t^g:Sher ^s_cp^;piitive input also displayed reversal 
effects. Modei^a^^ quality cognitive input, as reported by 
teachers, was ^^Ociated with the greatest gains in cognitive 
abilities for ch^-l^en with high pretest IQ's. However, high 
quality cognitive input was also associated with the intensifi- 
cation of feeling of alienation among parents whose children 
were more disadv^tagedj those with initially low IQ's, and 
those who were o^^^t at the beginning of the Head Start program. 

.Extensive use of d^^mati^jnate^^ (in 1967-68), which are 
distinguished by 3- large verbal component, wsls related to gains 
on indices of ov^^^ll verbal behavior fori 

—children with i^^tially low and mid-level IQ's 
—both younger af^* oldet children 
— urban children 

Interestingly, thi^ progtain activity was also associated with 
positive changes in measures of parents' perceived effects of 
Head Start among ja;^ents whose: 

—children had Irji^'^ially high IQ's 
— children were yOUnget 
— children were non^urb^ 

•The avadlability adeguacy o f laxgrg muscle eq ui pment was 
related positive!/ to gains in cognitive abilities for children 
with initially lo<^ and mid-level iQ»s; to achievement gains for 
older children; a^^ to improvements in cognitive style for 
non-urban childi^e^' 

However, the aMu^j^Jig^ of JLargg ^ muscle equi-pment was negatively 
associated with g^ihs in cognitive abilities for these same 
groups of childz^efl ^nd negatively related to parents" perceived 
effects of Head for all subgroups of children. As in an 

earlier section of this paper which summarized overall program 
effects, these tw^? Superficially similar variables display 
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opposite effects, particiaarly on the more disadvantaged Head 
Start children. Recall that availability can be construed 
merely as an index of program quality whereas actual use of 
large muscle equipment indicates the nature of children's 
experience in the program. 

degree of teacher continuity (in 1967-68) was associated 
with improvements in socieO. adjustment in the testing situation 
fori 



—children with initieuLly low IQ's 
— younger children 
— urban children 



.And, in 1967-68, a smaller pupil/teacher ratio was related to 
gains in both social adjustment to testing conditions and 
sociability among peers fori 

—children with both initially high and low IQ's 
— younger children 
— urban children 

Ratios ranged from a low of 3:1 to a high of iBil with over 
two-thirds of the children in classes .having ratios between 
5il and 9il. 

In these same groups of children, class transiency - or pupil 
turnover rate, was negatively related to gains in both 
cognitive abilities and sociability among peers suggesting that 
for some sul^roups of children a sta.ble peer environment appears 
to be more facilitative. 

.Negative relationships occurred between the level of the teacher's 
general educational preparation and gains in preschool achieve- 
ment and social adjustment (in both 1967-68 and 1968-69 data) 



—initially low IQ children 
— older children 
— urban children 



.Although teacher's paid experience with disadvantaged children 
was unrelated to gains in cognitive abilities, it was negatively 
related to gains in achievement, social adjustment and sociability 
among peers fori 

—children with initially low and mid level IQ's 
— younger children 
—urban children 
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It is noteworthy that parental feelings of power were 
negatively related to this variable and in all subgroups of 
children analyzed. As discussed earlier, the results associated 
with these variables axe problematic , paxtially because the 
variables axe not measures of specific eaxly childhood education 
or specific teacher competencies. 

What Kinds of Children Benefited Most from Different Teacher and 
Progrgtm Approaches? 

In this section program-child interactions will be presented to 
highlight the particular groups of children for whom certain clusters 
of program and teacher characteristics are most effective. Consistent 
with all preceeding analyses the subgroups of children axe differentiated 
according to initially high or low IQ, younger or older within the Head 
Start population, and urban or non-urban location, 

.For children with initially low IQ's (below 85) cognitive and 
social gains were greatest in programs where 1 

— child socialization goals and language development activities 

were strongly emphasized 
— the teacher used non-physical means of control 
-laxge muscle equipment was avcdlable and adequate but not 
excessively used 
— classroom atmosphere reflected teacher continuity, a low 

pupil/teacher ratio and low pupil turnover rate 
— teachers had less formal general education 

•For children with initially bl^h IQ's (over 95) cognitive and 
social gains were greatest in programs where: 

— the teacher reported a moderate quality of cognitive input to 
children 

— the teacher did not rely on physical means of control or 
discipline 

— a high emphasis was placed on goals of independence and 
self-caxe, and a moderate emphasis was placed on socializa- 
tion goals 

— the classroom atmosphere was characterized by a small pupil/ 
teacher ratio and low pupil turnover rate 

•For younger children (under five years of age) cognitive cUid 
social gains were greatest in prograuns where s 

— the teacher relied on non-punitive and non-physical methods of 
control 

— a moderate to high emphasis was placed on child independence, 
self-caxe and socialization goals 

U J 
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—dramatic materials were frequently used 
— the teacher reported a low program emphasis on language 
development 

— classroom atmosphere reflected high teacher continuity^ a low 
pupil/teacher ratio and low pupil turnover rate 

— the teacher had achieved a lower level of formal general 
education 

•^•'or older children (over five years of age) cognitive and social 
gains were greatest in prograjns where; 

— a strong emphasis was placed on language development 
— dramatic materials were extensively used 

—adequate large muscle equipment was available but not used 
excessively 

— the teacher had achieved a lower level of formal general 
education 

•For urban children cognitive and social gains were greatest in 
programs where: 

—the teacher used non-pvinitive , non-physical methods of control 
— a moderate to strong emphasis was placed on child independence 

and self -care 
— dramatic materials were extensively used 

—the classroom atmosphere reflected a high degree of teacher 
continuity, a low pupil/teacher ratio, and low pupil turnover 
rate 

— the teacher had achieved a comparatively lower level of formal 
general education and experience with disadvantaged children 

.For non-urban children cognitive gains were greatest in programs 
where : 

—adequate large muscle equipment was available but not 
excessively used 



Significa nce of the program-child interactions . The picture that 
emerges from these analyses reflects the complexity of the inter- 
relationships among children^ teachers, facilities, and programs. The 
findings suggest that there mav be two constellations of program vari- 
ables which are differentially effective for different groups of 
children. l6/ 



16/ Technically, the relationships listed hold for each separate program 
element and each child characteristic individually. No direct test 
was made of the specific combinations of program elements (e.g., 
programs which had large muscle equipment and strong langusige 
emphasis), but there is no reason to believe that clusters of program 
elements would produce effects different from those of the elements 
considered individually. 
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The fi3?st cluster includes prograjn elements related to the order- 
liness and structure of the total environment of the Head Staxt center. 
Specifically, these elements include i high quality facilities (as 
reflected in the availability of expensive large muscle equipment); 
low pupil/teacher ratios; high teacher continuity; and moderate to 
strong program emphases (including aceuiemic emphases), This program 
cluster seems to "be especially beneficial for children who begin Head 
Start with a relatively low pretest IQ (below 85) t children who are 
older, and children from both urbgui and non-^urbgui areas • 

The second cluster includes program eJieihents more related to the 
social and interpersonal climate of the class . These includes teacher's 
use of non-physical methods of control; moderate, socially-oriented 
program emphases (on independence, self-care and socialization); a 
stable classroom atmosphere characterized by high teacher continuity^ 
low pupil/ teacher ratios and low pupil turnover rate; and more frequent 
use of dramatic (role playing) activities. Children who especially 
benefit from these program/teacher inputs had high pretest IQ's 
relative to their Head Start counterparts, entered Head Start at an 
earlier age (under five years) and tended to live in urban axeas. 1?/ 

These two clusters of prograjn elements are reminiscent of the two 
hypothetical clusters of program variables identified by Stodolsky 
(1971) in a review of the results of one site research and evaluation 
reports. Stodolsky, however, reports that only the first of the clusters 
results in significant gains; these findings indicate that both program 
types are effective, but for different kinds of children. 

It should be noted, however, that these two clusters do overlap 
somewhat • They do not identify two completely distinct and independent 
prograjn orientations which are beneficial for two different and non- 
overlapping populations of children, A direct experimental test of the 
effectiveness of these two program clusters, with random assignment of 
children varying in IQ, age and urbanlcity, is needed before the pattern 
observed in these data is adopted as a sound educational policy and 
basis for program prescriptions. 

17/ In the national Head Start program, and in these evaluation data, 
child's age, IQ, urbanlcity, and geographic region are confounded. 
In the South, few school districts have public kindergartens and 
thus the children in the South are four and five year olds while 
children from other regions are three and four yeeir olds. IQ and 
age are negatively correlated in this sample. However, this probably 
does not reflect a decline in IQ performance with age. Vander Riet 
(1972) found that within a Southern sample, cross-sectional data 
showed a grade level decline in IQ while longitudinal data did not. 
It seems more likely that the frequently reported lower performance 
of children from the South (even when matched for age, income, and 
ethnicity) on a variety of achievement measures is responsible for 
the age/lQ correlation observed repeatedly in National Head Steirt 
data. 
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Notwithstanding these caveats, the similarity of these program 
clusters to the recommendations of most early childhood educators is 
striking. Essentially, the findings suggest that young, bright 
children who live in high resource areas (urban) benefit most from a 
preschool experience that encourages motivational independence and 
opportunities to initiate creative activities, while older and less able 
youngsters who live in less well endowed environments, benefit more 
from the program structure, clcLSsroom materials, and opportunities to 
develop specific skills ~ the difference between activities in a 
nursery school and those in a kindergarten. These data do not indicate 
what will benefit the younger and less able child or the older and 
brighter child since age and ability are confounded. As in the 
preceding instance, a direct experimental study is needed to provide 
hajrd evidence on this issue. 
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DISCUSSION 



Factors Influencing Developmental Changes 

Most significstnt in the Head Start evaluation data collected from 
1966-1969 are the documentation of a broad range of developmental gains 
over a broad range of Head Start programs and the implications concern- 
ing which factors influence differences in program effectiveness. 
Since no control data were obtained, these findings shoiid be construed 
only as statistical associations and not causal relationships between 
program inputs and child development outcomes. Nonetheless, these 
evaluation data do provide considerable information about the relative 
benefits of different types of Head Start programs for different types 
of children. From a research perspective, the data contribute most to 
the generation of increasingly specific hypotheses about early child- 
hood intervention. From a policy perspective, the findings also provide 
some tentative guidance for early intervention programs and evaluation. 

In the design of these evaluation studies, four types of factors 
were examined for their relative contribution to the developmental chang( 
associated with participation in the full-year Head Start classrooms. 
These factors were; family background characteristics, child charac- 
teristics, program and teacher characteristics, and combinations of 
program/teacher and child characteristics. The nature of the influence 
of these factors will be reviewed separately below. 

Family ba ckground characteristics . Although family background 
characteristics were shown to be an important determinant of initial 
levels of performance, their influence on changes in performance once 
in the program was negligible. The family characteristics most 
strongly associated with initially higher levels of performance reflect 
the dynamic aspects of family life ~ the quality of child's experience 
in the family and the nature of the parent-child relationship. These 
family characteristics included high parentsuL aspirations of expecta- 
tions for the child, a sense of confidence about life, adult accessi- 
bility to the child, and restraint in using physical punishment as a 
means of disciplining the child. 

Within the narrow range of socio-economic status of the families 
studied, one characteristic was positively associated with changes made 
in the program ~ the mother's level of education. This single finding 
is important, however, since it is consistent with those reported for 
most early intervention programs: program effects are smaller (and 
lost earlier) among the more deprived children (see Ryan, 1974; 
Bronfenbrenner, 197^). However, it should be mentioned that with the 
single exception of mother's education these evaluation data showed a 
more general, though statistically non-significant, tendency for the 
more deprived children to show the greatest gains. On balance, the 
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findings with respect to socio-economic status and gains are problematic. 

Characteristics of the child . Compared to the family background 
characteristics discussed above, background characteristics of children 
(i.e., age, sex, ethnicity, urbanicity and geographic region) were un- 
related or inconsistently related to initial differences in performance 
levels. But unlike family characteristics, some child characteristics • 
were found to be important determinants of performance changes made 
during the program. Of the attributes measured in these evaluation studies, 
child's relative age (i.e., below or above five years) and pretest IQ 
level (below 85; 85-95; or above 95) were significantly associated with 
gains on several measures of social and cognitive development. 

Younger children gained more in social adjustment and general cogni- 
tive ability whereas older children gained more in the acquisition of 
specific skills. These findings are consistent with the view expressed by 
many child development professionals that what a child gains from a pro- 
gram depends on the one hand, upon his competence level and stage of 
development and, on the other hand, the nature of the experiences provided 
to him/her In the program. These data show differing patterns of gains for 
older and younger children and do not support the conclusion that younger 
children gain more across-the-board. Bearing in mind that these data 
sample a fairly restricted age range of children (approximately two and 
one-half to six years) and are confounded with other variables measured 
in these studies, they do suggest that relative age of the preschool child 
is less a determinant of size of gains than a determinant of type of gain 
made in the program. 

.. The findings with respect to the relationship. between child's 
initial level of IQ and gains are especially problematic because of th= 
presence of testing artifacts, as discussed previously. The findings are 
in agreement with those commonly reported in other intervention research 
and show the expected greater IQ and achievement gains for children with 
the lowest entering IQ's. But one can argue, as Bronf enbrenner (1974) has, 
that the data are not sufficient to claim that the least competent chil- 

most from early intervention. As Bronfenbrenner has pointed out 
the existing evidence is based on studies which have not adequately con - ' 

U^i^^tn°''iO'^^^^u^?'' ^^^^^""^^^ exception of a study bv Herzo^ 

{see Ryan, 1974) that shows no long term differences between Initially 

high and low IQ groups. Yet, in the 1966-69 evaluation findings, the 

relationships between initial IQ levels and gains are so strikingly strong 

and linear that they lend some credence to the position that, despite 

testing artifacts, less competent (in terms of IQ) children do seem to 

make greater real gains, especially in cognitive areas. 

. Teacher and Program ch aracteristics . The major determinants of 
gains made in the Head Start program include a multiplicity of teacher 
and program characteristics. Teachers' background, teacher's approaches 
and program emphases, teachers' behavior in the classroom, classroom 
activities, equipment, materials, and classroom atmosphere were all 
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related to child development gains in preschool achievement, general 
cognitive ability, learning ability and social ad jiistment. 

In general, moderate and high levels of cognitive inputs or 
emphases yielded greater gains in cognitive ability and preschool 
achievement. This finding is consistent with those reported by 
Bronfenbrenner (l97^) ajid Steams (l97l) for other preschool interven- 
tion programs, aind supports the position that "cognitive oriented" or 
"cognitive structured" programs produce greater cognitive gains. It 
should be noted however, that the precise meaning of cognitive structure 
varies from program to program aind study to study, aind that in the 
context of most personal-social oriented Head Stsirt programs, cognitive 
structure seems to refer to teachers' emphasis on structured lessons, 
cognitive materials, and small group activities. 

It is noteworthy that these findings confirm that program elements 
or curricula do make a difference to program impact, both in terms of 
magnitude eind type. The influence of program variables is shown in two 
ways in these data. First, the overall pattern of gains parallels, 
almost exactly, the pattern .".f classroom activities that chaxacterized 
most Head Start programs, i.e., the extensi veness of a particulax 
program emphasis and the magnitude of gains corresponded for achieve- 
ment motivation, social adjustment, school readiness, academic achieve- 
ment, and finally, cognitive abilities. Second, more differentiated 
analyses revealed that eyen within the predominajit personal-social 
context of most Head Start classes, particular program elements were 
differentially associated with gains, e.g., more cognitively oriented 
programs tended to produce Isirger cognitive gains. 

These data also provide a tentative and partial answer to the 
question posed eaxlier on the relative contribution of program emphases 
or the ease of niodifying certain areas to differential gains in motiva- 
tion and cognitive abilities. The data show that program emphases do 
account for the greater gains in motivation and achievement as compared 
to gains in cognitive abilities. However, the research designs of 
these evaluation studies do not preclude the alternative interpretation 
that motiVc-\tion and achievement were more easily modified by early 
intervention than cognitive abilities. Interestingly, however, Ih'- lata 
do demonstrate that certain child characteristics (e.g., age, initl-l 
IQ level) modeirate the effects of prc^ram, emphases . 

The findings with respect to th( influence of teacher and program 
variables on developmental change shed less light on the issue of 
specificity versus equality of effects. Whether specific program 
approaches are associated with the corresjonding types of outcomes or 
with a variety of outcomes is not clearly resolved by these findings. 
Global appraisals of the first y^^ar's findings of the Planned Varia- 
tion study (see Eissell, 1971 ) reported comparable gains across all 
well-implemented curricula models. The largest differences in effects 
were found not between models, but between well-implemented curricula 
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models and more loosely formulated traditional Head Start programs. 
Bissell (1971) noted that fine grained analyses of the well-implemented 
prograjns did show specific but small effects corresponding to particular 
program emphases. Since most of the Head Start programs included in the 
1966-1969 evaluation studies were loosely formulated, traditional Head 
Start programs, the finding of specific effects is misleading; the 
primary question of whether a given program emphasis was more effective 
or merely better implemented cainnot be answered by these data. 

Specific program/child combinations . The last factor to be con- 
sidered as a significant influence on performance gains are the 
program/child interactions — specific groups of children who benefitted 
from particular types of program approaches. The interactions reported 
in these studies are few in number and small in size compared with the 
effects of program variables when considered independent of child 
variables. For this rea^^on the interactions should be construed as 
suggestive rather than prescriptive of different educational treatments 
for different kinds of children. 18/ 

In her description of two hypothetical types of early interven- 
tion programs, Steams (l97l) distinguishes between traditional pre- 
school programs and experimental research projects. She concludes that 
the former produce smaller gains and more interactions between indivi- 
dual and program characteristics, a tendency bom out in these evalua- 
tion findings and consistent with the formulation of Head l^laxt as a 
broadly based child development program. 



Implications for Barly Childhood Intervention 

^ In this section of the report the implications of these evaluation 
findings for early childhood intervention research-and programs will be 
presented and discussed. The order of presentation will parallel the 
preceding discussion of the findings. 

.Research and pro gram model development should be directed - 
at exploring alternative strategies of family-centered 

18/ The i nterpretation of the interactions in this study deserves addi- 
tional comment. It must be noted that the interactions were un- 
covered by an analytic model which was especially sensitive to their 
occurrence and which may have generated many "false positives/' 
M; aetheless, the interactions were consistent with the findings fr-r 
' dividual characteristics i;alcen one at a time. It is unfortui,-^-t,<l 
tnat more replication of these important findings was not possiMe, 
The use of somewhat different measures across the three years of 
study and of somewhat different statistical techniques across the 
two reports, makes the namber of direct replications relatively 
small . 
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Intervention t This recommendation is supported by the find- 
ings that the family is a major determinant of what the child 
brings to the program, which in turn influences the benefits 
he or she derives from the prograim. These paxticulax findings 
also point out characteristics of the parent-child relationship 
which are related to differences in child competence upon entry 
and which sire potentially manipulable through family-interven- 
tion strategies, e.g., parental discipline techniques, frequen- 
cy of reading to the child, adult accessibility. These findings 
also suggest that child-oriented interventions should explore 
new ways to build upon and enhance parents' skills as effective 
educational and socializing agents. 

. Children from a wide variety of circumstsmces are likely to 
benefit in the short term from early intervention programs like 
Head Start . The findings reported here show that virtually all 
children gain from their Head Start experience, althoiagh some 
children gain more than others or gain on different outcome 
variables. It seems likely that the broadly-based and multi- 
aimed objectives of most Head Start programs provide a 
sufficient diversity of experience to match some developmental 
needs of most of the children attending the program. The find- 
ings illuminate the importance of systematic assessment of the 
developmental needs of the entering population of Head Start 
children with a view toward improving the individ\ialization of 
Head Start experiences. 

.There i s no one best proCTam or curriculum approach fox^ all 
children; program approaches should be matched to the child 
and the teacher . Within the range of program variations and 
children studied in these evaluations, no single program 
approach proved to be superior for all children across all 
outcome domains. What a child brings to a program influences 
the way he is affected by his Head Start experiences. Further, 
the experiences which influence his cognitive development may 
be different' from the experiences which influence his personal- 
social development. This means that studies averaging gains 
over classes and children, or focusing on only one kind of 
outcome are likely to misrepresent the most significant 
effects. In terms of early educational practices, the find- 
ings imply a need to develop techniques for fine-grained 
developmental assessment, to verify the effectiveness of 
individualized programs, and then to develop feasible ways of 
providing such programs on a large scale, if a focus on early 
child development continues to be a national priority. 

In terms of early intervention research, the findings imply a 
need for more investigation of the interactions between 
teacher characteristics (including teacher style) and child 
characteristics, and more precise identification of teacher 
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skills and attributes which are likely to enhance children's 
gains. It should be noted that the Child Development Associate 
effort is currently addressing some of these issues. 

.Improved -program planning, more -precise articulation of program 
ob.jectives. and closer approximation of practice to child 
development theory, should improve the matching of program 
experiences to the needs of individual children and should 
increase overall program e-ffectiveness . The interactions 
between child and program characteristics and the specificity 
of effec-ts for certEiin program elements underscore the 
importance of more rigorous program planning on the one hand, 
and the assessment of children's needs, on the other. The 
findings with respect to child/program interactions also point 
to -the continuing need to achieve congruence between program 
objectives, child development outcomes, and objective outcome 
measures . 



Summary 



As a concluding reminder, the reader is urged to consider the 
findings from these national evaluations in the context of the objectives 
of Project Head Start, as stated in the Cooke Memorandum. These objec- 
tives axG heavily weighted in favor of social and emotional growth; 
however, the most significant relationships reported here are weighted 
in favor of achievement motivation, skills, and cognitive growth. 
Furthermore, of the many components of Head Start, only one (center 
classroom; T^as used as a data basis for these evaluations. 

Apart from the obvious need to develop measurement instruments 
appropriate for non-cognitive objectives, this "mismatch" between 
program objectives and assessment may illuminate the possibility that a 
broad program can never be fairly evaluated in its entirety. Just as 
these data have shown that some centers do better at advancing cognitive 
development, it is likely that some centers have more successful career 
development and training programs than others. No single evaluation can 
ever capture all that is Head Start. 
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APPENDIX A 



Measure Selection and Psychometric Characteristics 



Child and family Measures 1968-1969 

Information available on the child included pre- and post-test 
scores on the following instruments: Stanford-Bine t, Inventory of 
Factors Affecting the Stanf ord-Binet, Hertzig-Birch Response Style, 
Gumpgookies (achievement motivation). Sociometric Play Situation, 
Animal House test: form the Wechsler Preschool and Primary Scale of 
Intelligence, and the Caldwell-Soule Preschool Inventory, Extensive 
demographic information was available for all children on a master data 
card. This included the child's age, sex, geographic region, urbanicity, 
ethnicity, and so forth. 

1. Cognitive Development Measures 

Stanford-Binet iq. The Stanford-Binet was developed as an 
intelligence test, as a measure of the individual's 
ability to learn from experience. This interpretation 
requires that the individual tested have been exposed to 
experiences similar to those of individuals on whom the 
measure was standardized. In the 1957 revision of this 
test, the standardization sample over- re presented white 
and more advantaged children. Even if the sample had been 
representative of the ethnic and economic circumstances of 
children in the general population, the performance of 
any group whose life experiences diverged from the norm 
sample as a whole, would have to be interpreted as an 
indication of both experience per se and the ability to 
learn from experience. The higher IQs of more advantaged 
children do indicate a greater ability to perform certain 
tasks, but that ability is related both to possible 
differences in learning capacity and to certain differences 
in life circumstances that favor performance on such test 
items. The lower IQs typically achieved by children from 
economically constricted circumstances indicate less 
ability on the particular skills measured by the Binet, 
but again, the level of performance is due both to possible 
differences in learning capacity and to certain differences 
in life expeilences that do not favor performance on test 
items. In addition, scores on this and other individually 
administered tests depend heavily on the child's coopera- 
tiveness, motivation, and rapport with the examiner. 

As a measure of performance, the absolute level of the 
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Stanford-Binet scores has repeatedly bsen found sensitive 
to intervention of many kinds; the relative position of 
individual scores within a group exposed to the same 
intervention changes much less. Where the intervention 
is very (effective, the individual differences within 
groups may actually be increased; as children become more 
able to learn they in turn respond to new opportunities 
for learning. The Binet frequently has been reported to 
predict school achievement as well as rate of leaxning 
on a variety of new tasks. 

Animal House . This subscale of the Wechsler Preschool and 
Primaxy Scale of Intelligence is similar to the auiult 
Digit-symbol or Coding tests. The materials consist of a 
board with pictures of four different animals arranged in 
rows with spaces for "houses" beneath each picture, and a 
supply of colored pegs ~ the "houses", A key at the top 
of the board identifies the color for each animal's house, 
and the child's task is to correctly place a house under 
each animal on the boaxd. The raw score is a combination 
of time, errors, and omissions; an aige-scaled score may 
also be derived. The WPPSI is a standardized test of the 
ability to learn, and the Animal House subtest was selec- 
ted as a relatively culture-free, nonverbal test of the 
ability to leaxn a new task. 

Preschool Inventory , The Caldwell -So ule Preschool 
Inventory was developed in 1965 specifically to assess the 
level of cognitive maturation of Head Start children. It 
seems to measure cognitive functioning at a more concrete 
level than the Stanford-Binet, and is interpreted as a 
measure of achievement or school readiness. There are four 
subtests to the seals: Social Responsiveness, Associative 
Vocabulary, Concept Activation/Numerical, and Concept 
Activation/Sensory, Scores on the subscales are highly 
intercorrelated. Although the form used in 1968-69 was 
then an experimental version, it is now a standard, 
commercially avai.lable instrument. Data from the 1968-69 
pretesting provided the national standardization data for 
xow-income children, 

Social-Emotional Development Measures 

Inventory of Factors A ffecting the Stanford-Binet , This 
instrument is a modification of the fact sheet of the 
Stanford-Binet record form. The examiner rates the child 
on his behavior during the Stanford-Binet testing session, 
assessing such traits as compulsivity, concentration, and 
activity level. This adaptation was prepared by Dr. 
Herbert Zimiles of the Bank Street College of Education, 
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Hertzlg-Blrch Response Style . This instrument, like the 
Inventory of factors Affecting the Stanford-Binet, is 
given in conjxinction with the Stanford-Binet, and was 
designed to measure various cognitive styles that the 
child employs when responding to test items. For example, 
a distinction is made between a wrong response where the 
child "works at" a problem solution and a wrong response 
where the child does not, Hertzig, et.al. (I968) have 
demonstrated differences in these cognitive styles among 
vaxious ethnic and socio-economic groups, 

Gumpgookles > The Gumpgookies test is an experimentatl 
measure of achievement motivation designed for preschool 
children (Adkins and Ballif, 1970) • The form used for 
initial testing had 100 items; the post- test version had 
only 55 items. The reduction in length wsis required both 
by the short attention span of the children and by 
psychometric shortcomings of some items. Each item 
consists of a picture of two pillow-like creatures, one of 
which is doing something that is more achievement oriented 
than what the other one is doing. For example, one figure 
might be successfully completing the building of a block 
tower while the other was sitting in a heap of fallen 
blocks. The child is asked which Gumpgookie is his 
Gumpgookie (i.e. the one that acts like he does). The 
internal consistency reliability for the test is .76. 
Although Gumpgookies scores do discriminate between 
children rated by their teachers as high or low in 
motivation, the scores are also influenced by idiosyn- 
crasies in responding, such as position preference. 

Sociometrlc Play Situation , In this task designed by 
Dr. Robert Boget, each child selects (a) three play 
situations depicted on six cards, and then (b) the 
photograph of a preferred classmate to put into each of 
the three play situations. Thus, measures "of popularity 
and isolation can be derived for each child. The 
structure of the task resulted in arUfactual pre-post 
correlations; if some children's relative popularity 
increased, the remaining children's relative popularity 
had to decrease. 

Parent and Fajnily Measures 

Parent I nterview . Pre- and post-interviews with the 
parent (usually the mother) yielded information about 
potential variables includingi the family's socioeconomic 
status (family income, a father's occupation, mother's 
education, father's education); aspects of family life 
(family structure, frequency of reading to the child, 
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amount of living space, pstrent participation in the Head 
Start program); parents' attitudes toward society in 
general (expectations for the child's education, aispira- 
tions for the child's education, feelings of alienation)* 
In addition, the interview presented many open-ended 
questions intended to reflect typical styles of socisuL 
interaction between parent aJid child. The variables raid. 
measxires pertaining to parents' attitudes and parent-child 
interaction merit greater elaboration below. 

The measures of parental attitudes toward, society and life 
in generaJ. are interpreted as "locus-of -control" measures-, 
The individual with a sense of internctl control believes 
that he has the power to control his own life a>i-:2 to 
influence the lives of others; the individual vCth a sense 
of e xternal control beli eves he has littlfc cc^nand over 
what happened to him euid his faMly. Scores*; on tests of 
locus-of-control have been found to predict which indivi- 
duals from disadvantaged circumstances will be able to 
overcome their environmental constrictions. For example, 
Coleman (1965) found a positive correlation between high 
test performance and internal control in Black children; 
in other studies, changes in. locus-of -control have been 
associated with changes in achievement. Nonetheless, the 
direction of a causality is an unresolved question: are 
"internals" more accurate in rating their influence axid 
power because they really are more competent, or are they 
able to take advantage of opportunities simply because 
they believe they can? 

Information about parent-child interactions was derived 
from four questions about the parents' disciplinary 
practices. One question asked the mother to name one of 
the worst things that her Head Start child does and 
explain how she responds to it. The second question asked 
about "little annoying things" and the mother's response; 
questions three and four related to child behaviors which 
the mother liked and liked "a great deal." The responses 
were categorized according to the means by which parents 
modified or reinforced the child's ongoing behavior. The 
broad categories werei non-intervention, intervention, 
qualified power assertion, and unqualified power asser- 
tion. This measure was developed by Dr. Irving Siegle 
and Dr. Bela r'ehrer, with Dr. Fehrer supervising the 
coding of the present data. Similar items have been 
found to predict preschool achievement. 

Another open-ended question asked the mother was, "What 
would you tell your child to do on the first day of 
school?" This item measui*es how the mother views the 
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teacher, classroonii social situations, achievement, home 
preparation, and personal safety. The mother's responses 
Were categorized by trained raters under the direction 
of Dr. Robert Hess. These ratings have been shown to be 
sensitive to social class differences within samples of 
Black children, aind to pred^ •^t both i :;eschool aind later 
primary school achievemen-?- 



Teacher and Pro ^g rajn Measures 1968-1^6? 

1. Teacher Self -Reports 

T^-cb gher Interview . Questions on this instrument 
explored prograjn focus, importance of child development 
goals, description r>f teaching approach, and kinds of 
Commercial curriculum materials used by the teacher. 
It was developed as an independent reliability mestsure 
for this intervention studies of 1968-69 by Dr. Carolyn 
^tem. 

Checklist of Administrative Veiriables , This checklist 
Was designed, to determine the person(s) responsible for 
each of the 31 key functions or activities in full-year 
Head Start programs. Examples of these functions 
included preparing the agenda for teachers' meetings 
and planning parent orientation sessions. The check- 
list was prepared by Dr. Shuell Jones as a meats ure of 
the quality of icograsa administration as seen by Head 
Start teachers. 

Characteristics of Teaching Staff Form . This question- 
naire, prepared by Dr. Carolyn Stem, was administered 
at the end of the program year and provided information 
regarding the head teacher, teacher, and teacher aidei 
age, sex, ethnicity, education, training, experience. 

C^lass Facilities aind Resources Inventory . This form is 
similar to that used in census surveys to determine 
compliance with Head Start guidelines. The class 
information included the operating length of the program, 
length of class day, number of children in class, and the 
Various educational and physical resources available to 
the children. 

2. Observations of Classroom Activities 

Post Observation Teacher Rating Scale . This instrument 
consisted of 33 ratings of teacher behavior made by the 
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same observer and on the same day that OSCI observations 
were obtained (see below) , Ratings incliided such teacher 
behavior as her reliance on on-going activities, her 
attention to groups, and her awareness of pupil frustra- 
tion. This scale, developed by Dr. John Dopyera, is 
based on the work of Harvey, Hunt, and Schroeder (1967) 
on the "openess" and "closedness" of individual style. 

Observation of Substantive Gurrlcular Input (OSGl ). The 
OSCI was developed by Dr. Carolyn Stem. The instrxiraent 
consists of a complex schedule that is used by a trained 
observer to code the behavior of the class and the 
teacher. 

The OSCI coding system is based on a series of three 
minute scans of or-going activity in the classroom. In 
some of the scans the observer looks at what the 
children axe doing, and in others she looks at the 
teacher and teacher-aide. For each scan, the behaviors 
of the largest gi-oup of children are observed and coded. 
The major codes reflect the context of activity (i.e., 
building), the content of the activity (i.e., mechanical), 
the locus -of -behavior (i.e., teacher or child initiated), 
and teacher involvement (i.e., present and watching)., 
Within the three-minute period, the obsez:ver then 
locates the next largest group, and repeats the coding 
process. A three-minute record could consist of one 
unit (if all the children sure occupied in one activity) 
or of as many imits as there are children (if each child 
is doing something individually) . Three-minute scans of 
teacher activity record teacher context, teacher content, 
and teacher involvement for six 30-second intervals. 

These teacher and classroom observations were collected 
on five separate days spread through the middle and end 
of the program yesir. Each daily observation covered 
most of the school day for the majority of Head Start 
sample classes . Observers attended special three-day 
training sessions and were supervised regularly for 
quality and reliability. 

These voluminous data were analyzed by concentrating on 
selected content-context categories and control categories 
for both class and teacher observations, frequency 
measures were then intercorrelated, factor analyzed, and 
factor rotated. Six rotated factors were retained for 
further analyses in both the class and teacher data. 

The six teacher factors were identified as followss 
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I. Social-emotional Inteiractlon . Partially defined 
"by emotional and social behavior of the teacher# 

II • Structured Lessons — Large Groux) . Partially 
defined "by use of structured laJiguage materials, 
structured visual discrimination lessons, use of 
program materials, 

III* Art Activities , Partially defined "by art content 
in a painting context, suid verbsuL communication in 
an art context. 

IV. Creative Instruction Small Group . Partially 
defined by visual motor activities, small groups, 
presence of large and small muscle equipment. 

V. Routines, Partially defined by teacher use of 
mechanical devices, and teacher activity in routine 
context. 

deceptive Learning . Partially defined by language- 
content in a watching-listening context, visual 
discrimination in an activity context, and presence" 
of language materials. 

The six OSCI class factors were labelled as follows: 

^* Structure d Lessons . High loadings fron presence of 
programmed materials, language through structured 
lessons, control by the head teacher, and large 
group activities, 

II • Group Act ivities and Routines , Partially defined 
by music, drama, and art activities in a social 
context, 

III' Social-e motional Interaction . Partially defined by 
emotional content in all contexts, social content 
in a physical contact context, and control b'. a 
child, 

IV. Verbal Co mmunication , Primarily defined by verbal 
communication in a variety of contexts including 
activities, arts, and routines, 

^- Instruction in Creat ive Arts. Primarily defined hy 
visual motor content in an activities context, 
r^ij.ld control, small groups, presence of dramatic 
piay equipment, and dramatic content in a perform- 
ing context. 
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VI. Lanp;uaKe and Discrimination Leaxnlng , Prim^ily 
defined by the presence of language materials, 
language content in a watchlng-listening context, 
and vj,sual discrimination in an activities context. 



Measures Used in 1967-1968 and 1966-1967 



The data collected in 1967-I968 and 1966-I967 are used lu this 
summary report chiefly for the purpose of indicating replication of 
findings reported for 1968-I969. Therefore, the discussion of the 
measuring instruments for 1967-I968 and 1966-I967 is "briefer and more 
general than for those used in the primary data base, i.e., 1968-I969. 



1. Child and Family Measures 

Stanford-Binet IQ . These 
children in the 1966-I967 



scores were available for most 
and 1967-1968 samples. 



Inventory of Factors Affecting the Stanford-Binet . Scores 
were available for 1967-1968 only. 

Preschool Inventory . Scores were avsiilable for 1966-I967 
only. 

Head Start Behavior Inventory . Behavior ratings made by 
the Head Start teachers were available for 1966-1967 only. 

Parent Interview, Pre- and post-program interviews similar 
to those obtained in 1968-I969 were availa*hle for both 

1966- 1967 and 1967-1968. 

2. Teacher and Program Measures 

Characteristics of Teaching Staff Form . This instrument 
provided data similar to that of the 1968-I969 study for 
the two previous years. 

Center and Classroom Composition Instrument . This question- 
naire provided information similar to that from the Class 
Facilities and Resources Inventory. 

Observation of Substantive Curricular Input (OSCi ) . The 

1967- 196J 03CI was similar to the 1968-I969 OSCI but 
gather^-ed information only on the class, rather than both 
class and teacher. Analysis of the intercorrelations of 
the frequencies of the various class content-context 
categories across classes yielded only four rotated Class 
Factor Scores, rather than six, as in 1968-I969. These 
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four factors axe briefly described belows 

I. High Cognitive Activity in a low structure situa- 
tion . Mostly defined by high loadings of verbal 
instruction during learning activities and verbal 
instruction during routines. 

II • Routines and Rules . Characterized by high loadings 
on rules emphasized during routines, whole group 
activities, and social interaction during learning 
activities. 

III. High Cognitive Activity in a High Structure Situa- 
tion . Partially defined by positive loadings of 
visual discrimination and cognitive input during 
watching or listening, and a negative loading for 
individual activity. 

IV. Child-Centered, Unstructured . Partially defined by 
high positive loadings for social interaction during 
learning activities and high negative loadings for 
mechanicaJL perfoinnance of routines. 

Social Interaction Observation (SIO) . This instrument, 
unique to the 1967-1968 data, was used to gather detailed 
information about child-child and adiilt-child interactions 
in the classroom. Developed at the University of Kansas, 
the SIO records the number of verbal and nonverbal initia- 
tions from the child to an adiilt, the' number of responses 
by the adult, and' the number of adiilt initiations respond- 
ed to by the child. 

Observer Rating Form . This instrument was used only in 
1966-1967 to obtain classroom information. It was 
developed by Dr. John Pierce-Jones and his associates. 
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APPENDIX B 



Selection of Variables for Analysis 

RTI 

Originally, 90 explanatory and outcome variables were considered for 
analysis based on previous reseaxch and experience. Variables from the 
preliminary list were subsequently dropped for a vaxiety of reasons, 
the most common ones being: (l) computational, and cost factors in 
deriving an index from the raw data provided on tape, (2) undesirable 
marginal distributions (e.g., no variation in an explanatory variable), 
(3) low internal consistency reliability of derived or scaled variables 
as measured by Kuder-Richardson Formula 20, and (^) apparent uninterpre- 
tability or extreme redundancy as shown in analyses of the intercor- 
relations and interdependencies among many sets of prelirainaxy outcome 
and explanatory variables. . 

SDC 

The method selected for the screening process was a correlational 
analysis approach which permitted exajnination of laxge numbers of 
variables at relatively low cost* For each dependent variable, the 
Preaxson product-moment correlation was computed between the pre - 
t'^^-t score and each program vaxiable; next the correlation was computed 
}x vween the post - test score and each program variable. The differences 
/Netween those two correlations were computed and a "t" test was per- 
4.'ormed on each pair to test the null hypothesis of "no difference." A 
'::jrTelation increase was interpreted as suggestive of a positive 
relatf.onc:h? p between the program variable and the performance measure, 
and a decrease as a negative relationship. After m,iny variables were 
eliminated frax consideration because the pre-post correlation differ- 
ence did not reach statistical significance, a few were returned to the 
list because of their importance in the literature on their place in 
a priori hypotheses. 



Selection of Analytic Models 

RTI 

To identify effects that were relatively independent of subsetting 
variables, RTI performed a series of four-way ANOVAs in which the 
children's adjusted gain scores were examined when all possible oneway 
effects and two-way interactions (except those of the prime oneway 
vaxiable to be tested) were taken into account. This approach is 
maximally sensitive to the stability of the main effect when several 
subsetting variables are simultaneously tak«n into account. RTI 
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also reported results from multiple regression analyses of classroom 
and teacher characteristics (as measured by the OSCi) used to predict 
children's gains on the Stanford-Binet, Preschool Inventory, and 
Feelings of Inadequacy icale. 

SDC 

Interactions were inferred from the retention or drop-out of the 
statistical significance of a main effect when tested in separate 
subset analyses (for example, high vs. low levels of "Large Muscle 
Equipment" was tested separately for groups defined by initial IQ score 
low, medium, and high). In all of these ANOVAs (over lOOO), a specially 
adjusted post-test store was used. In effect ^ the initial score was a 
covariate, except that the regression weight of the lowest level of the 
program variable was used for all of the calculations. This procedure 
was chosen as appropriate for an exploratory study, in spite of the fact 
thai it does increase the chance of finding spuriously significant 
interactions. 



Differences in Application of Aiiu.lytic Models 
1968-1969 Data 

1. Child Characteristics 

Intellift'ence ( Stanford-Binet IQ) 
^TI . Used as an outcome variable. 

SDC. Used as an outcome variable, and used pretest Stanford- 
Binet score as a subsetting variable for testing inter- 
actions. 

Ability to Lean-^ a New Task (Animal House ) 

RTI. Used raw score as an outcome variable, 
SDC. Used scaled scores as an outcome variable. 

Achievement and Schoo l Readiness (Caldwell-Soule Pr Gsr^hnni inven- 
torj;) 

^n. Used raw score on total PSI as an outcome variable. 
SDC. Used raw scores on the four subscales and an age-normed 
score on the total score as outcome variables. 

Adjustment to a Hov el Situation ( inventor/ Factors Affecting 
the Stanford-Binet "" ^ 

RIT. Used three rationally derived subscales as outconic 

variables: Behavior Problems, Motivational Prohl'.iUt,, 
and Feelings of Inadequacy. 
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SDC. Used total score as an outcome variable interpreted as an 
overall index of adjustment. 

Cognitive Style (Hertzig-Birch Response Style ) 

RTI. Did not use these data. 

SDC* Used three subscores as outcome variables: Work Response 
Score, Verbal Response Score, and Spontaneous Response 
Score • 

AchieVcf -'i Tignt Motivation (Gumpgookles ) 

RTI , Used as an outcome variable 
SDC, Did not use these data. 

Children's Sociability (Sociometric Play Situation ) 

RTI , Did not use these data. 

SDC * Used a measure of social isolation as an outcome variable. 
Parent and Family Characteristics 

Socio-economic Status, Parental Attitudes, Styles of Parent-Child 
Interactions, Household Routines (Parent Interview ) 

RTI. Used these as piedictor variables, 

SDC. Used some fajnily characteristics as outcome variables and 
others as predictor variables. 

Teacher and Program Characteristics 

Propcram Emphasis, Goals, and Teaching Approach (Teacher Interview j 

RTI. Derived several program variables from the teacher responses, 
3DC, Used these responses as a primary indicator of program 
content. 

Administrative Variables (Checklist ) 

RTI, Hade no use of these data. 
SDC. Made no use of these data. 

Teacher* s Background (Characteristics of Teaching Staff ^orm ) 

RTI. Used these variables as predictors, 
3DC. Used these variables as predictors. 
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Proprram Faollitles and Resources (Class Facilities and Resources 
Inventory ) ' ' " 

RTI. Used four (rather global) categories as predictor 
variables . 

SDC * Used two (rather specific) categories as predictor 
variables . 

Teacher's Classroom Behavior (Post Observation Rating Scale ) 

RTI. Used some items as predictor variables. 
SDC. Used some items predictor variables. 

Teacher's and Children's Classroom Behaviors (Observation of 
Substantive Curricular Input ) ~ 

RTI. Used these data as predictor variables and as subsetting 
variables for testing ir.t«jractions. 

SDC . Did not use these data. 
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